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SEQUENCE LISTING 



10 



55 



60 



65 



<110> KUEHN, Ralf 

FELDER, Susanne 
SCHWENK, Frieder 
KUETER-LUKS, Birgit 
FAUST, Nicole 

<120> Modified Recombinase 

<130> 012787wo/JH/ml 



15 <140> 
<141> 

<160> 108 

,0 <170> Patentln Ver. 2.1 

I <210> 1 

<211> 86 

<212> DNA 

5 <213> Artificial Sequence 

i 

<220> . 

: <223> Description of Artificial Sequence: primer C31-4 

0 <400> 1 

cgtgacggtc tcgaagccgc ggtgcgggtg ccagggcgtg cccttgggct ccccgggcgc 60 
gtactccacc tcacccatct ggtcca 86 

5 <210> 2 
■ <211> 86 
<212> DNA 

<213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: primer C31-5 
<400> 2 

cgtggaccag atgggtgagg tggagtacgc gcccggggag cccaagggca cgccctggcc 60 
45 cacgcaccgc ggcttcgaga ccgtca " qq 

<210> 3 
<211> 90 
50 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer C31-6 
<400> 3 

gatccagaag cggttttcgg gagtagtgcc ccaactgggg taacctttga gttctctcag 60 
ttgggggcgt agggtcgccg acatgacacg 90 

<210> 4 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 



44 



<223> Description of Artificial Sequence: primer C31-7-2 

<400> 4 

gatccgtgtc atgtcggcga ccctacgccc ccaactgaga gaactcaaag gttaccccag 60 
ttggggcact actcccgaaa accgcttctg 90 



<210> 5 
<211> 7438 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector pRK64 
<400> 5 

cgtcatcacc gaaacgcgcg aggcagctgt ggaatgtgtg tcagttaggg tgtggaaagt 60 
, ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 120 
ggctccccag caggcagaag tgtgcaaagc atgcatctca attagtcagc aaccatagtc 180 
10 ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 240 
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctaggaacag 300 

! tcgacgacac tgcagagacc tacttcacta acaaccggta cagttcgtgg accagatggg 360 

tgaggtggag tacgcgcccg gggagcccaa gggcacgccc tggcacccgc accgcggctt 420 
cgagaccgtc acgaataact tcgtatagca tacattatac gaagttataa gcttgcatgc 480 

25 ctgcaggtcg gccgccacga ccggccggcc ggtgccgcca ccatcccctg acccacgccc 540 

, ctgacccctc acaaggagac gaccttccat gaccgagtac aagcccacgg tgcgcctcgc 600 

cacccgcgac gacgtccccc gggccgtacg caccctcgcc gccgcgttcg ccgactaccc 660 
cgccacgcgc cacaccgtcg acccggaccg ccacatcgag cgggtcaccg agctgcaaga 720 

j actcttcctc acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg acgacggcgc 780 

30 cgcggtggcg gtctggacca cgccggagag cgtcgaagcg ggggcggtgt tcgccgagat 84 0 
cggcccgcgc atggccgagt tgagcggttc ccggctggcc gcgcagcaac agatggaagg 900 

I cctcctggcg ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg tcggcgtctc 960 

j gcccgaccac cagggcaagg gtctgggcag cgccgtcgtg ctcc.ccggag tggaggcggc 1020 

cgagcgcgcc ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc ' tccccttcta 1080 

|3 5 cgagcggctc ggcttcaccg tcaccgccga cgtcgagtgc ccgaaggacc gcgcgacctg 1140 

\ gtgcatgacc cgcaagcccg gtgcctgacg cccgccccac gacccgcagc gcccgaccga 1200 

aaggagcgca cgaccccatg gctccgaccg aagccgaccc gggcggcccc gccgaccccg 1260 

: cacccgcccc cgaggcccac cgactctaga ggatcataat cagccatacc acatttgtag 1320 

aggttttact tgctttaaaa aacctcccac acctccccct gaacctgaaa cataaaatga 1380 

40 atgcaattgt tgttgttaac ttgtttattg cagcttataa tggttacaaa taaagcaata 1440 
gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca 1500 
aactcatcaa tgtatcttat catgtctgga tccgtgtcat gtcggcgacc ctacgccccc 1560 
aactgagaga actcaaaggt taccccagtt ggggcactac tcccgaa'aac cgcttctgga 1620 
tccataactt cgtatagcat acattatacg aagttatacc gggccaccat ggtcgcgagt 1680 

45 agcttggcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 1740 
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 1800 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgctt tgcctggttt 18 60 
ccggcaccag aagcggtgcc ggaaagctgg ctggagtgcg atcttcctga ggccgatact 1920 
gtcgtcgtcc cctcaaactg gcagatgcac ggttacgatg cgcccatcta caccaacgta 1980 

50 acctatccca ttacggtcaa tccgccgttt gttcccacgg agaatccgac gggttgttac 204 0 
tcgctcacat ttaatgttga tgaaagctgg ctacaggaag gccagacgcg aattattttt 2100 
gatggcgtta actcggcgtt tcatctgtgg tgcaacgggc gctgggtcgg ttacggccag 2160 
gacagtcgtt tgccgtctga atttgacctg agcgcatttt tacgcgccgg agaaaaccgc 2220 
ctcgcggtga tggtgctgcg ttggagtgac ggcagttatc tggaagatca ggatatgtgg 2280 

55 cggatgagcg gcattttccg tgacgtctcg ttgctgcata aaccgactac acaaatcagc 2340 
gatttccatg ttgccactcg ctttaatgat gatttcagcc gcgctgtact ggaggctgaa 2400 
gttcagatgt gcggcgagtt gcgtgactac ctacgggtaa cagtttcttt atggcagggt 24 60 
gaaacgcagg tcgccagcgg caccgcgcct ttcggcggtg aaattatcga tgagcgtggt 2520 
ggttatgccg atcgcgtcac actacgtctg aacgtcgaaa acccgaaact gtggagcgcc 2580 

60 gaaatcccga atctctatcg tgcggtggtt gaactgcaca ccgccgacgg cacgctgatt 264 0 
gaagcagaag cctgcgatgt cggtttccgc gaggtgcgga ttgaaaatgg tctgctgctg 2700 
ctgaacggca agccgttgct gattcgaggc gttaaccgtc acgagcatca tcctctgcat 2760 
ggtcaggtca tggatgagca gacgatggtg caggatatcc tgctgatgaa gcagaacaac 2820 
tttaacgccg tgcgctgttc gcattatccg aaccatccgc tgtggtacac gctgtgcgac 2880 

65 ■ cgctacggcc tgtatgtggt ggatgaagcc aatattgaaa cccacggcat ggtgccaatg 2940 
aatcgtctga ccgatgatcc gcgctggcta ccggcgatga gcgaacgcgt aacgcgaatg 3000 
gtgcagcgcg atcgtaatca cccgagtgtg atcatctggt cgctggggaa tgaatcaggc 3060 




cacggcgcta atcacgacgc gctgtatcgc 
gtgcagtatg aaggcggcgg agccgacacc 
gcgcgcgtgg atgaagacca gcccttcccg 
ctttcgctac ctggagagac gcgcccgctg 
5 aacagtcttg gcggtttcgc taaatactgg 
ggcggcttcg tctgggactg ggtggatcag 
ccgtggtcgg cttacggcgg tgattttggc 
aacggtctgg tctttgccga ccgcacgccg 
cagcagtttt tccagttccg tttatccggg 
10 ttccgtcata gcgataacga gctcctgcac 
gcaagcggtg aagtgcctet ggatgtcgct 
gaactaccgc agccggagag cgccgggcaa 
aacgcgaccg catggtcaga agccgggcac 
gaaaacctca gtgtgacgct ccccgccgcg 
15 gaaatggatt tttgcatcga gctgggtaat 
tttctttcac agatgtggat tggcgataaa 
ttcacccgtg caccgctgga taacgacatt 
aacgcctggg tegaacgctg gaaggcggcg 
cagtgcacgg cagatacact tgctgatgcg 
! catcagggga aaaccttatt tatcagccgg 
j atggcgatta ccgttgatgt tgaagtggcg 
! ctgaactgcc agctggcgca ggtagcagag 
gaaaactatc ccgaccgcct tactgccgcc 
[ gacatgtata ccccgtacgt cttcccgagc 
> ttgaattatg gcccacacca gtggcgcggc 
! caacagcaac tgatggaaac cagccatcgc 
ctgaatatcg acggtttcca tatggggatt 
tcggcggaat tccagctgag cgccggtcgc 
taataataac cgggcagggg ggatctttgt 
0 attggacaaa ctacctacag agatttaaag 
ataatgtgtt aaactactga ttctaattgt 
tgatgaatgg gagcagtggt ggaatgccag 
ttggacaaac cacaactaga atgcagtgaa 
! ctattgcttt atttgtaacc attataagct 
5 ttcattttat gtttcaggtt cagggggagg 
I tctacaaatg tggtatggct gattatgatc 
! ttttataggt taatgtcatg ataataatgg 
gaaatgtgcg cggaacccct atttgtttat 
tcatgagaca ataaccctga taaatgcttc 
40 ttcaacattt ccgtgtcgcc cttattccct 
ctcaccbaga aacgctggtg aaagtaaaag 
gttacatcga actggatctc aacagcggta 
gttttccaat gatgagcact tttaaagttc 
acgccgggca agagcaactc ggtcgccgca 
45 actcaccagt cacagaaaag catcttacgg 
ctgccataac catgagtgat aacactgcgg 
cgaaggagct aaccgctttt ttgcacaaca 
gggaaccgga gctgaatgaa gccataccaa 
caatggcaac aacgttgcgc aaactattaa 
50 aacaattaat agactggatg gaggcggata 
ttccggctgg ctggtttatt gctgataaat 
tcattgcagc actggggcca gatggtaagc 
ggagtcaggc aactatggat gaacgaaata 
ttaagcattg gtaactgtca gaccaagttt 
55 ttcattttta atttaaaagg atctaggtga 
tcccttaacg tgagttttcg ttccactgag 
cttcttgaga tccttttttt ctgcgcgtaa 
taccagcggt ggtttgtttg ccggatcaag 
gcttcagcag agcgcagata ccaaatactg 
60 acttcaagaa ctctgtagca ccgcctacat 
ctgctgccag tggcgataag tcgtgtctta 
ataaggcgca gcggtcgggc tgaacggggg 
cgacctacac cgaactgaga tacctacagc 
aagggagaaa ggcggacagg tatccggtaa 
65. gggagcttcc agggggaaac gcctggtatc 
gacttgageg tcgatttttg tgatgctcgt 
gcaacgcggc ctttttacgg ttcctggcct 




tggatcaaat ctgtcgatcc ttcccgcccg 3120 
acggccaccg atattatttg cccgatgtac 3180 
gctgtgccga aatggtccat caaaaaatgg 3240 
atcctttgcg aatacgccca cgcgatgggt 3300 
caggcgtttc gtcagtatcc ccgtttacag 3360 
tcgctgatta aatatgatga aaacggcaac 3420 
gatacgccga acgatcgcca gttctgtatg 3480 
catccagcgc tgacggaagc aaaacaccag 354 0 
caaaccatcg aagtgaccag cgaatacctg 3600 
tggatggtgg cgctggatgg taagccgctg 3660 . 
ecacaaggta aacagttgat tgaactgcct 3720 
ctctggctca cagtacgcgt agtgcaaccg 3780 
atcagcgcct ggcagcagtg gcgtctggcg 384.0 . 
tcccacgcca tcccgcatct gaccaccagc 3900 
aagcgttggc aatttaaccg ccagtcaggc 3960 
aaacaactgc tgacgccgct gcgcgatcag 4020 
ggcgtaagtg aagegacccg cattgaccct 4080 
ggGcattacc aggccgaagc agcgttgttg 414 0_ 
gtgctgatta cgaccgctca cgcgtggcag 4200 
aaaacctacc ggattgatgg tagtggtcaa 4260 
agcgatacac cgcatccggc gcggattggc 4320 
cgggtaaact ggctcggatt agggccgcaa 4380 
tgttttgacc gctgggatct gccattgtca 4 4 40 
gaaaacggtc tgcgctgcgg gacgcgcgaa 4500 
gacttccagt tcaacatcag ccgctacagt 4560 
catctgctgc acgcggaaga aggcacatgg 4 620 
ggtggcgacg actcctggag cccgtcagta 4 680 
taccattacc agttggtctg gtgtcaaaaa 4740 
gaaggaacct tacttctgtg gtgtgacata 4800 
ctctaaggta aatataaaat ttttaagtgt 4 860 
ttgtgtattt tagattccaa cctatggaac 4920 
atccagacat gataagatac attgatgagt 4 980 
aaaaatgctt tatttgtgaa atttgtgatg 5040 
gcaataaaca agttaacaac aacaattgca 5100 
tgtgggaggt tttttaaagc aagtaaaacc 5160 
tgcggccgca gggcctcgtg atacgcctat 5220 
tttcttagac gtcaggtggc acttttcggg 5280 
ttttctaaat acattcaaat atgtatccgc 5340 
aataatattg aaaaaggaag agtatgagta 54 00 
tttttgcggc attttgcctt cctgtttttg 54 60 
atgctgaaga tcagttgggt gcacgagtgg 5520 
agatccttga gagttttcgc cccgaagaac 5580 
tgctatgtgg cgcggtatta tcccgtattg 5640 
tacactattc tcagaatgac ttggttgagt 5700 
atggcatgac agtaagagaa ttatgcagtg 5760 
ccaacttact tctgacaacg atcggaggac 5820 
tgggggatca tgtaactcgc cttgatcgtt 5880 
acgacgagcg tgacaccacg atgcctgtag 5940 
ctggcgaact acttactcta gcttcccggc 6000 
aagttgcagg accacttctg cgctcggccc 6060 
ctggagccgg tgagcgtggg tctcgcggta 6120 
cctcccgtat cgtagttatc tacacgacgg 6180 
gacagatcgc tgagataggt gcctcactga 624 0 
actcatatat actttagatt gatttaaaac 6300 
agatcctttt tgataatctc atgaccaaaa 6360 
cgtcagaccc cgtagaaaag atcaaaggat 6420 
tctgctgctt gcaaacaaaa aaaccaccgc 6480 
agctaccaac tctttttccg aaggtaactg 6540 
tccttctagt gtagccgtag ttaggccacc 6600 
acctcgctct gctaatcctg ttaccagtgg 6660 
ccgggttgga ctcaagacga tagttaccgg 6720 
gttcgtgcac acagcccagc ttggagcgaa 6780 
gtgagctatg agaaagcgcc acgcttcccg 6840 
gcggcagggt cggaacagga gagcgcacga 6900 
tttatagtcc tgtcgggttt cgccacctct 6960 
caggggggcg gagcctatgg aaaaacgcca 7020 
tttgctggcc ttttgctcac atgttctttc 7080 
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ctgcgttatc 
ctcgccgcag 

caatacgcaa 
ggtttcccga 
attaggcacc 
gcggataaca 



ccctgattct gtggataacc 
ccgaacgacc gagcgcagcg 
accgcctctc cccgcgcgtt 
ctggaaagcg ggcagtgagc 
ccaggcttta cactttatgc 
atttcacaca ggaaacagct 



gtattaccgc ctttgagtga gctgataccg 7140 

agtcagtgag cgaggaagcg gaagagcgcc 7200 

ggccgattca ttaatgcagc tggcacgaca 7260 

gcaacgcaat taatgtgagt tagctcactc 7320 

ttccggctcg tatgttgtgt ggaattgtga 7380 

atgaccatga ttacgccaag ctggcgcg 7438 



<210> 6 
10 <211> 45 
<212> DNA 

<213> Artificial Sequence ... 
<220> 

15 <223> Description of Artificial Sequence: primer C31-1 
<400> 6 

ataagaatgc ggccgcccga tatgacacaa ggggttgtga ccggg 45 

i 

JO 

; <210> 7 
! <211> 40 
J <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer C31-3 
<400> 7 

30 ataagaatgc ggccgcatcc gccgctacgt cttccgtgcc 40 

j <210> 8 

! <211> 24 

135 <212> DNA 

! <213> Artificial Sequence 



25 



40 



<220> 

<223> .Description of Artificial Sequence: primer C31-8 
<400> 8 

cccgttggca ggaagcactt ccgg 24 



45 <210> 9 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: primer C31-9 



55 



<400> 9 

ggatcctcga gccgcgggcg gccgcctacg ccgctacgtc ttccgtgccg tcctg 55 



<210> 10 
<211> 5711 

<212> DNA ; 
60 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
pCMV-C31-Int (wt) 

65 

<400> 10 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 
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10 



15 



50 



35 



40 



45 



50 



55 



60 



65 



tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
tgagtactcc 
cgaggaggat 
ctggtcagaa 
gccatacact 
cagggcggcc 
cttacgaccg 
gtagcgccaa 
ggttcaggtt 
agcgcccgga 
ttgtctatga 
cggaattgct 
gaaacgtcat 
cgctgaagtc 
tcggcgggaa 
acggccgaat 
gacccttcga 
aacaccttcc 
tttgtaagcg 
ccgcttcaag 
cgggcttcgc 
ttgagggtta 
gaccgatcat 
gcggcaaggg 
agtgtggcgc 
gccgtcgccg 
tcagcatggc 
ccgaaggcga 
tcactgaggc 
ccctgaacgc 
ttggcaggaa 
aagagcggct 
tccccgaaga 
tagacgacaa 
ctacgggcag 
cgccgaccga 
cggcgcccgg 
tgtgtgttgg 
ggggaggggg 
ggggaggcca . 
tacaacgtcg 
cccctttcgc 
tgcgcagcct 
gtatttcaca 
gccagccccg 
catccgctta 
cgtcatcacc 
atgtcatgat 
gaacccctat 
aaccctgata 
gtgtcgccct 
cgctggtgaa 
tggatctcaa 
tgagcacttt 
agcaactcgg 
cagaaaagca 
tgagtgataa 



ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
gcccgatatg 
tcagtcgcgc 
cgaagacaag 
cgtcgggcat 
gttcgaacgc 
cgtgtcgcgc 
cgccctgggc 
ggacctgatt 
ggcgaagatt 
ggcgccttac 
ggtcaatgtc 
gttcgagccc 
cttcaagccg 
catggacgct 
cgcctgggac 
cgctgaggtg 
ccgcattcag 
cgagcccgct 
gctttcccgg 
cgtcatgact 
gaaggtggtc 
ggcactcgac 
cgaagagacg 
gcctgagaag 
ccttgaagag 
gcacttccgg 
tgccgaactt 
cgccgacgct 
gcgcgtgttc 
ggggcaggga 
cgacgacgaa 
gctcgagatc 
ttttttgtgt 
aggccagaat 
gaatgaggcg 
tgactgggaa 
cagctggcgt 
gaatggcgaa 
ccgcatatgg 
acacccgcca 
cagacaagct 
gaaacgcgcg 
aataatggtt 
ttgtttattt 
aatgcttcaa 
tattcccttt 
agtaaaagat 
cagcggtaag 
taaagttctg 
tcgccgcata 
tcttacggat 
cactgcggcc 



attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
acacaagggg 
gagcgcgaga 
gcggccgacc 
ttcagcgaag 
atcctgaacg 
ttctcgegcc 
gtgacgattg 
cacctgatta 
ctcgacacga 
ggcttcgagc 
gtcatcaaca 
gacgtaatcc 
ggcagtcaag 
gacgccgtgc 
ccggcaaccg 
atctacaaga 
cgcgacccga 
gagtggtatg 
gggcaagcca 
tcgaagcgcg 
gacccgtccg 
aagttcgttg 
ttggcgcttc 
agcggcgaac 
ctgtacgaag 
aagcaacagg 
gaagccgccg 
gacccgaccg 
gtcgggctct 
acgcccatcg 
gacgacgccc 
caggcgcgga 
gccttggggg 
gaccttgggg 
cgcccccggg 
aaccctggcg 
aatagcgaag 
tggcgcctga 
tgcactctca 
acacccgctg 
gtgaccgtct 
agacgaaagg 
tcttagacgt 
ttctaaatac 
taatattgaa 
tttgcggcat 
gctgaagatc 
atccttgaga 
ctatgtggcg 
cactattctc 
ggcatgacag 
aacttacttc 



agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
ttgtgaccgg 
attcgagcgc 
ttcagcgcga 
cgccgggcac 
aatgccgcgc 
tgaaggtcat 
tttccactca 
tgcggctcga 
agaaccttca 
ttgtttcgga 
agcttgcgca 
ggtggtggtg 
ccgccattca 
cgacccgggg 
ttatgcgaat 
agaagccgga 
tcacgctccg 
agcttcaggc 
ttctgtccgc 
gggaagaatc 
cacctgggca 
cggaacgcat 
tgtgggaagc 
gggcgaacct 
accgcgcggc 
cagcgctgac 
aagccccgaa 
gccctaagtc 
tcgtagacaa 
agaagcgcgc 
aggacggcac 
tcaataaaag 

agggggaggc 

gagggggagg 
taccgagctc 
ttacccaact 
aggcccgcac 
tgcggtattt 
gtacaatctg 
acgcgccctg 
ccgggagctg 
gcctcgtgat 
caggtggcac 
attcaaatat 
aaaggaagag 
tttgccttcc 
agttgggtgc 
gttttcgccc 
cggtattatc 
agaatgactt 
taagagaatt 
tgacaacgat 



tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctecacagg 
ggtggaeacg 
agcaagccca 
agtcgagcgc 
gtcggcgttc 
cgggcggctc 
ggacgcgatt 
ggaaggcgtc 
cgcgtcgcac 
gcgcgaattg 
gacgaaggag 
ctcgaccact 
gcgtgagatc 
cccgggcagc 
cgagacgatt 
ccttcgggac 
cggcacgccg 
gccggtcgag 
gtggttggac 
catggacaag 
gatcaaggac 
gcacgaaggc 
cttcaacaag 
cgcccgacgc 
tgttgcggag 
aggcgcgtac 
gctccggcag 
gcttcccctt 
gtggtggggg 
gatcgttgtc 
ttcgatcacg 
ggaagacgta 
atcattattt 
cagaatgagg 
ccagaatgac 
gaattcactg 
taatcgcctt 
cgatcgccct 
tctccttacg 
ctctgatgcc 
acgggcttgt 
catgtgtcag 
acgcctattt 
ttttcgggga 
gtatccgctc 
tatgagtatt 
tgtttttgct 
acgagtgggt 
cgaagaacgt 
ccgtattgac 
ggttgagtac 
atgcagtgct 
cggaggaccg 



cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
agcgttgggg 
tttccaaaaa 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
tacgcgggtg 
gcgacacagc 
gacgggggcc 
gggacggcgg 
aacatgatca 
ccgattgtct 
ttccggcagg 
aaagaatctt 
ggcgggtacg 
atcacgcgca 
ccccttaccg 
aagacgcaca 
atcacggggc 
gggaagaaga 
ccgcgtattg 
accacgaaga 
cttgattgcg 
ggcagggggc 
ctgtactgcg 
tcttaccgct 
acgtgcaacg 
atcaggcacg 
ttcggcaagc 
cgcgccgacg 
gacggacccg 
caaggggcgg 
gaccaatggt 
cgcgcgtcag 
acgaagtcga 
tgggcgaagc 
gcggcgtagg 
tcaatagatc 
cgcggccaag 
cttgggggag 
gccgtcgttt 
gcagcacatc 
tcccaacagt 
catctgtgcg 
gcatagttaa 
ctgctcccgg 
aggttttcac 
ttataggtta 
aatgtgcgcg 
at'gagacaat 
caacatttcc 
cacccagaaa 
tacatcgaac 
tttccaatga 
gccgggcaag 
tcaccagtca 
gccataacca 
aaggagctaa 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

114 0 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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ccgctttttt gcacaacatg ggggatcatg 
tgaatgaagc cataccaaac gacgagcgtg 
cgttgcgcaa actattaact ggcgaactac 
actggatgga ggcggataaa gttgcaggac 
ggtttattgc tgataaatct ggagccggtg 
tggggccaga tggtaagccc tcccgtatcg 
ctatggatga acgaaataga cagatcgctg 
aactgtcaga ccaagtttac tcatatatac 
ttaaaaggat ctaggtgaag atcctttttg 
agttttcgtt ccactgagcg tcagaccccg 
ctttttttct gcgcgtaatc tgctgcttgc 
tttgtttgcc ggatcaagag ctaccaactc 
cgcagatacc aaatactgtc cttctagtgt 
ctgtagcacc gcctacatac ctcgctctgc 
gcgataagtc gtgtcttacc gggttggact 
ggtcgggctg aacggggggt tcgtgcacac 
aactgagata cctacagcgt gagctatgag 
cggacaggta tccggtaagc ggcagggtcg 
ggggaaacgc ctggtatctt tatagtcctg 
gatttttgtg atgctcgtca ggggggcgga 
ttttacggtt cctggccttt -tgctggcctt 
ctgattctgt ggataaccgt attaccgcct 
gaaegaccga gcgcagcgag tcagtgagcg 
cgcctctccc cgcgcgttgg ccgattcatt 
ggaaagcggg cagtgagcgc aacgcaatta 
aggctttaca ctttatgctt ccggctcgta 
ttcacacagg aaacagctat gaccatgatt 
cctgcaggtt t 



taactcgcct tgatcgttgg gaaecggagc 4140 
acaccacgat gcctgtagca atggcaacaa 4200 
ttactctagc ttcccggcaa caattaatag 4260 
cacttctgcg ctcggccctt ccggctggct 4320 
agcgtgggtc tcgcggtatc attgcagcac 4380 
tagttatcta cacgacgggg agtcaggcaa 4440 
agataggtgc ctcactgatt aagcattggt 4500 
tttagattga tttaaaactt catttttaat 4560 
ataatctcat gaccaaaatc ccttaacgtg 4620 
tagaaaagat caaaggatct tcttgagatc 4680 
aaacaaaaaa accaccgcta ccagcggtgg 4740 
tttttccgaa ggtaactggc ttcagcagag 4800 
agccgtagtt aggccaccac ttcaagaact 4860 
taatcctgtt accagtggct gctgccagtg 4 920 
caagacgata gttaccggat aaggcgcagc 4980 . 
agcccagctt ggagcgaacg acctacaccg 504 0 
aaagcgccac gcttcccgaa gggagaaagg 5100 
gaacaggaga gcgcacgagg gagcttccag 5160 
tcgggtttcg ccacctctga cttgagcgtc 5220 
gcctatggaa aaacgccagc aacgcggcct 5280 
ttgctcacat gttctttcct gcgttatccc 534 0 
ttgagtgagc tgataccgct cgccgcagcc 5400 
aggaagcgga agagcgccca atacgcaaac 5460 
aatgcagctg gcacgacagg tttcccgact 5520 
atgtgagtta gctcactcat taggcacccc 5580 
tgttgtgtgg aattgtgagc ggataacaat 5640 
acgccaagct agcccgggct agcttgcatg 5700 

5711 



<210> 11 
<211> 69 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer C31-2-2 
<400> 11 

tagaattccg ctcgagagtc taaaccttcc tcttcttctt aggcgccgct acgtcttccq 60 
tgccgtcct ~ 69 



<210> 12 
<211> 5723 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
pCMV-C31-Int (CNLS) 

<400> 12 

cctgcaggtt taaacagtcc gatgtacggg 
ctagttatta atagtaatca attacggggt 
gcgttacata acttacggta aatggcccgc 
tgacgtcaat aatgacgtat gttcccatag 
aatgggtgga ctatttacgg taaactgccc 
caagtacgcc ccctattgac gtcaatgacg 
acatgacctt atgggacttt cctacttggc 
ccatggtgat gcggttttgg cagtacatca 
gatttccaag tctccacccc aftgacgtca 
gggactttcc aaaatgtcgt aacaactccg 
tacggtggga ggtctatata agcagagctc 
ggcttatcga aattaatacg actcactata 
aagcgttggg gtgagtactc cctctcaaaa 
gtttccaaaa acgaggagga tttgatattc 



Sequence: vector 



ccagatatac gcgttgacat tgattattga 60 
cattagttca tagcccatat atggagttcc 120 
ctggctgacc gcccaacgac ccccgcccat 180 
taacgccaat agggactttc cattgacgtc 240 
acttggcagt acatcaagtg tatcatatgc 300 
gtaaatggcc cgcctggcat tatgcccagt 360 
agtacatcta cgtattagtc atcgctatta 420 
atgggcgtgg atagcggttt gactcacggg 480 
atgggagttt gttttggcac caaaatcaac 540 
ccccattgac gcaaatgggc ggtaggcgtg 600 
tctggctaac tagagaaccc actgcttact 660 
gggagaccca agctgactct agacttaatt 720 
gcgggcatga cttctgcgct aagattgtca 780 
acctggcccg cggtgatgcc tttgagggtg 840 



gccgcgtcca tctggtcaga aaagacaatc tttttgttgt caagcttgag gtgtggcagg 900 
cttgagatct ggccatacac ttgagtgaca ttgacatcca ctttgccttt ctctccacag 960 
gtgtccactc ccagggcggc cgcccgatat gacacaaggg gttgtgaccg gggtggacac 1020 
gtacgcgggt gcttacgacc gtcagtcgcg cgagcgcgag aattcgagcg cagcaagccc 1080 
5 agcgacacag cgtagcgcca acgaagacaa ggcggccgac cttcagcgcg aagtcgagcg 1140 
cgacgggggc cggttcaggt tcgtcgggca tttcagcgaa gcgccgggca cgtcggcgtt 1200 
cgggacggcg gagcgcccgg agttcgaacg catcctgaac gaatgccgcg ccgggcggct 1260 
caacatgatc attgtctatg acgtgtcgcg cttctcgcgc ctgaaggtca tggacgcgat 1320 
tccgattgtc tcggaattgc tcgccctggg cgtgacgatt gtttccactc aggaaggcgt 1380 
10 cttccggcag ggaaacgtca tggacctgat tcacctgatt atgcggctcg acgcgtcgca 1440 
caaagaatct tcgctgaagt cggcgaagat tctcgacacg aagaaccttc agcgcgaatt 1500 
gggcgggtac gtcggcggga aggcgcctta cggcttcgag cttgtttcgg agacgaagga 1560 
gatcacgcgc aacggccgaa tggtcaatgt cgtcatcaac aagcttgcgc actcgaccac 1620 
tccccttacc ggacccttcg agttcgagcc cgacgtaatc cggtggtggt ggcgtgagat 1680 
15 caagacgcac aaacaccttc ccttcaagcc gggcagtcaa gccgccattc acccgggcag 1740 
catcacgggg ctttgtaagc gcatggacgc tgacgccgtg ccgacccggg gcgagacgat 1800 
tgggaagaag accgcttcaa gcgcctggga cccggcaacc gttatgcgaa tccttcggga 18 60 
cccgcgtatt gcgggcttcg ccgctgaggt gatctacaag aagaagccgg acggcacgcc 1920 
gaccacgaag attgagggtt accgcattca gcgcgacccg atcacgctcc ggccggtcga 1980 
) gcttgattgc ggaccgatca tcgagcccgc tgagtggtat gagcttcagg cgtggttgga 2040 
! cggcaggggg cgcggcaagg ggctttcccg ggggcaagcc attctgtccg ccatggacaa 2100 
j gctgtactgc gagtgtggcg ccgtcatgac ttcgaagcgc ggggaagaat cgatcaagga 2160 
ctcttaccgc tgccgtcgcc ggaaggtggt cgacccgtcc gcacctgggc agcacgaagg 2220 
cacgtgcaac gtcagcatgg cggcactcga caagttcgtt gcggaacgca tcttcaacaa 2280 
5 gatcaggcac gccgaaggcg acgaagagac gttggcgctt ctgtgggaag ccgcccgacg 2340 
cttcggcaag ctcactgagg cgcctgagaa gagcggcgaa cgggcgaacc ttgttgcgga 24 00 
. gcgcgccgac gccctgaacg cccttgaaga gctgtacgaa gaccgcgcgg caggegcgta 2460 
i cgacggaccc gttggcagga agcacttccg gaagcaacag gcagcgctga cgctccggca 2520 
gcaaggggcg gaagagcggc ttgccgaact tgaagccgcc gaagccccga agcttcccct 2580 
l0 tgaccaatgg ttccccgaag acgccgacgc tgacccgacc ggccctaagt cgtggtgggg 2640 
gcgcgcgtca gtagacgaca agcgcgtgtt cgtcgggctc ttcgtagaca agatcgttgt 2700 
; cacgaagtcg actacgggca gggggcaggg aacgcccatc gagaagcgcg cttcgatcac 27 60 
j gtgggcgaag ccgccgaccg acgacgacga agacgacgcc caggacggca cggaagacgt 2820 
. agcggcgcct aagaagaaga ggaaggttta •gactct'cgag atccaggcgc ggatcaataa 2880 
35 aagatcatta ttttcaatag atctgtgtgt tggttttttg tgtgccttgg gggaggggga 2940 
ggccagaatg aggcgcggcc aagggggagg gggaggccag aatgaccttg ggggaggggg 3000 
aggccagaat gaccttgggg gagggggagg ccagaatgag gcgcgccccc gggtaccgag 3060 
ctcgaattca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca 3120 
acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg 3180 
caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgcc tgatgcggta 3240 
ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc tcagtacaat 3300 
ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg , ctgacgcgcc 3360 
ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 3420 
ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa agggcctcgt 3480 
45 gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 3540 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 3600 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 3660 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 3720 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 3780 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 3840 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 3900 
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 3960 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 4020 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 4080 
55 gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 4140 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 4200 
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 4260 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 4320 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 4380 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 4440 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 4500 
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 4560 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatet 4620 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 4680 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 4740 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 4800 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 4860 
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gttaggccac 
gttaccagtg 
atagttaccg 
cttggagcga 
cacgcttccc 
agagcgcacg 
tcgccacctc 
gaaaaacgcc 
catgttcttt 
agctgatacc 
ggaagagcgc 
ctggcacgac 
ttagctcact 
tggaattgtg 
gctagcccgg 



cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 
agggagcttc 
tgacttgagc 
agcaacgcgg 
cctgcgttat 
gctcgccgca 
ccaatacgca 
aggtttcccg 
cattaggcac 
agcggataac 
gctagcttgc 



actctgtagc 
gtggcgataa 
agcggtcggg 
ccgaactgag 
aggcggacag 
cagggggaaa 
gtcgattttt 
cctttttacg 
cccctgattc 
gccgaacgac 
aaccgcctct 
actggaaagc 
cccaggettt 
aatttcacac 
atg 



accgcctaca 
gtcgtgtctt 
ctgaacgggg 
atacctacag 
gtatccggta 
cgcctggtat 
gtgatgctcg 
gttcctggcc 
tgtggataac 
cgagcgcagc 
ccccgcgcgt 
gggcagtgag 
acactttatg 
aggaaacagc 



tacctcgctc 
accgggttgg 
ggttcgtgca 
cgtgagctat 
agcggcaggg 
ctttatagtc 
tcaggggggc 
ttttgctggc 
cgtattaccg 
gagtcagtga 
tggccgattc 
cgcaacgcaa 
cttccggctc 
tatgaccatg 



tgctaatcct 
actcaagacg 
cacagcccag 
gagaaagcgc 
tcggaacagg 
ctgtcgggtt 
ggagcctatg 
cttttgctca 
cctttgagtg 
gcgaggaagc 
attaatgcag 
ttaatgtgag 
gtatgttgtg 
attacgccaa 



4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5723 
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<210> 13 
<211> 4960 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
pCMV-Cre 



vector 



<400> 13 

aaacagtccg 

tagtaatCaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

tgagtactcc 

cgaggaggat 

ctggtcagaa 

gccatacact 

cagggcggcc 

ccaaaatttg 

ggacatgttc 

ttgccggtcg 

tgaagatgtt 

ccagcaacat 

tgacagcaat 

tgaacgtgca 

catggaaaat 

taacaccctg 

tgacggtggg 

tgtagagaag 

tggtgtagct 

cgcgccatct 

tcatcgattg 

acacagtgcc 

gatcatgcaa 

ggatagtgaa 

taaatgattg 

caataaaaga 

gggggaggcc 

agggggaggc 

accgagctcg 

tacccaactt 



atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 

ctctcaaaag 

ttgatattca 

aagacaatct 

tgagtgacat 

tcgaccatgc 

cctgcattac 

agggatcgcc 

tgggcggcat 

cgcgattatc 

ttgggccagc 

gctgtttcac 

aaacaggctc 

agcgatcgct 

ttacgtatag 

agaatgttaa 

gcacttagcc 

gatgatccga 

gccaccagcc 

atttacggcg 

cgtgtcggag 

gctggtggct 

acaggggcaa 

cagatccact 

tcattatttt 

agaatgaggc 

cagaatgacc 

aattcactgg 

aatcgccttg 



cagatatacg 

attagttcat 

tggctgaccg 

aacgccaata 

cttggcagta 

taaatggccc 

gtacatctac 

tgggcgtgga 

tgggagtttg 

cccattgacg 

ctggctaact 

ggagacccaa 

cgggcatgac 

cctggcccgc 

ttttgttgtc 

tgacatccac 

ccaagaagaa 

cggtcgatgc 

aggcgttttc 

ggtgcaagtt 

ttctatatct 

taaacatgct 

tggttatgcg 

tagcgttcga 

gccaggatat 

ccgaaattgc 

tccatattgg 

tgggggtaac 

ataactacct 

agctatcaac 

ctaaggatga 

ccgcgcgaga 

ggaccaatgt 

tggtgcgcct 

agttctaggg 

caatagatct 

gcggccaagg 

ttgggggagg 

ccgtcgtttt 

cagcacatcc 



cgttgacatt 

agcccatata 

cccaacgacc 

gggactttcc 

catcaagtgt 

gcctggcatt 

gtattagtca 

tagcggtttg 

ttttggcacc 

caaatgggcg 

agagaaccca 

gctgactcta 

ttctgcgcta 

ggtgatgcct 

aagcttgagg 

tttgcctttc 

gaggaaggtg 

aacgagtgat 

tgagcatacc 

gaataaccgg 

tcaggcgcgc 

tcatcgtcgg 

gcggatccga 

acgcactgat 

acgtaatctg 

caggatcagg 

cagaacgaaa 

taaactggtc 

gttttgccgg 

tcgcgccctg 

ctctggtcag 

tatggcccgc 

aaatattgtc 

gctggaagat 

ccgcgtcgac 

gtgtgttggt 

gggaggggga 

gggaggccag 

acaacgtcgt 

ccctttcgcc 



gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggCgtgt 

ctgcttactg 

gacttaatta 

agattgtcag 

ttgagggtgg 

tgtggcaggc 

tctccacagg 

tccaatttac 

gaggttcgca 

tggaaaatgc 

aaatggtttc 

ggtctggcag 

tccgggctgc 

aaagaaaacg 

ttcgaccagg 

gcatttctgg 

gttaaagata 

acgctggtta 

gagcgatgga 

gtcagaaaaa 

gaagggattt 

agatacctgg 

gctggagttt 

atgaactata 

ggcgattagc 

ctcgagatcc 

tttttgtgtg 

ggccagaatg 

aatgaggcgc 

gactgggaaa 

agctggcgta 



tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

agcgttgggg 

tttccaaaaa 

ccgcgtccat 

ttgagatctg 

tgtccactcc 

tgaccgtaca 

agaacctgat 

ttctgtccgt 

ccgcagaacc 

taaaaactat 

cacgaccaag 

ttgatgccgg 

ttcgttcact 

ggattgctta 

tctcacgtac 

gcaccgcagg 

tttccgtctc 

atggtgttgc 

ttgaagcaac 

cctggtctgg 

caataccgga 

tccgtaacct 

Cattaacgcg 

aggcgcggat 

ccttggggga 

accttggggg 

gcccccgggt 

accctggcgt 

atagcgaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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■ 51 ...... 

ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat 24 60 
gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 2520 
tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 2580 
cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 2640 
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 2700 
cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc 2760 
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 2820 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 2880 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 2940 
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 3000 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 3060 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 3120 
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 3180 
gaatgacttg gttgagtact caecagtcac agaaaagcat cttacggatg gcatgaqagt 3240 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 3300 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 3360 
aactcgcctt gatcgttggg aaccggagct gaatgaagec ataccaaacg acgagcgtga 3420 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 34 80 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 3540 
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 3600 
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 3660 
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 3720 
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 3780 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 3840 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 3900 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 3960 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4020 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4080 
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 414 0 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4200 
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4260 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4320 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4 380 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt ■ atagtcctgt 4440 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4 500 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4560 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4 620 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4 680 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 4740 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 4800 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 4860 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 4 920 
cgccaagcta gcccgggcta gcttgcatgc ctgcaggttt 4 960 

<210> 14 
<211> 3858 
<212> ' DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vector pRK50 
<400> 14 

55 aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 



- _> -_j - ^ ^ ^ v-^^.^ «,w^ i-i.an.aa KJ \J 

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 
aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 
attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 
tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 
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cgaggaggat 

ctggtcagaa 

gccatacact 

cagggcggcc 

atagatctgt 

ggccaagggg 

gggggagggg 

gtcgttttac 

gcacatcccc 

caacagttgc 

ctgtgcggta 

tagttaagcc 

ctcccggcat 

ttttcaccgt 

taggttaatg 

gtgcgcggaa 

agacaataac 

catttccgtg 

ccagaaacgc 

atcgaactgg 

ccaatgatga 

gggcaagagc 

ccagtcacag 

ataaccatga 

gagctaaccg 

ccggagctga 

gcaacaacgt 

ttaatagact 

gctggctggt 

gcagcactgg 

caggcaacta 

cattggtaac 

ttttaattta 

taacgtgagt 

tgagatcctt 

gcggtggttt 

agcagagcgc 

aagaactctg 

gccagtggcg 

gcgcagcggt 

tacaccgaac 

agaaaggcgg 

cttccagggg 

gagcgtcgat 

gcggcctttt 

ttatcccctg 

cgcagccgaa 

cgcaaaccgc 

cccgactgga 

gcaccccagg 

taacaatttc 

ttgcatgcct 



ttgatattca 

aagacaatct 

tgagtgacat 

gcgtcgacct 

gtgttggttt 

gagggggagg 

gaggccagaa 

aacgtcgtga 

ctttcgccag 

gcagcctgaa 

tttcacaccg 

agccccgaca 

ccgcttacag 

catcaccgaa 

tcatgataat 

cccctatttg 

cctgataaat 

tcgcccttat 

tggtgaaagt 

atctcaacag 

gcacttttaa 

aactcggtcg 

aaaagcatct 

gtgataacac 

cttttttgca 

atgaagccat 

tgegcaaact 

ggatggaggc 

ttattgctga 

ggccagatgg 

tggatgaacg 

tgtcagacca 

aaaggatcta 

tttcgttcca 

tttttctgcg 

gtttgccgga 

agataccaaa 

tagcaccgcc 

ataagtcgtg 

cgggctgaac 

tgagatacct 

acaggtatcc 

gaaacgcctg 

ttttgtgatg 

tacggttcct 

attctgtgga 

cgaccgagcg 

ctctccccgc 

aagcgggcag 

ctttacactt 

acacaggaaa 

gcaggttt 



cctggcccgc 
ttttgttgtc 
tgacatccac 
cgagatccag 
tttgtgtgcc 
ccagaatgac 
tgaggcgcgc 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
catatggtgc 
cccgccaaca 
acaagctgtg 
acgcgcgaga 
aatggtttct 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 

ggggggttcg 

acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
gcgttggccg 
tgagcgcaac 
tatgcttccg 
cagctatgac 



ggtgatgcct 
aagcttgagg 

tttgcctttc 
gcgcggatca 

ttgggggagg 

cttgggggag 
ccccgggtac 
cctggcgtta 
agcgaagagg 
cgcctgatgc 
actctcagta 
cccgctgacg 
accgtctccg 
cgaaagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
attcattaat 
gcaattaatg 
gctcgtatgt 
catgattacg 



ttgagggtgg 
tgtggcaggc 
tctccacagg 
ataaaagatc 
gggaggccag 
ggggaggcca 
cgagctcgaa 
cccaacttaa 
cccgcaccga 
ggtattttct 
caatctgctc 
cgccctgacg 
ggagctgcat 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcagctggca 
tgagttagct 
tgtgtggaat 
ccaagctagc 



ccgcgtccat 

ttgagatctg 

tgtccactcc 

attattttca 

aatgaggcgc 

gaatgacctt 

ttcactggcc 

tcgccttgca 

tcgcccttcc 

ccttacgcat 

tgatgccgca 

ggcttgtctg 

gtgtcagagg 

cctattttta 

tcggggaaat 

tccgctcatg 

gagtattcaa 

ttttgctcac 

agtgggttac 

agaacgtttt 

tattgacgcc 

tgagtactca 

cagtgctgcc 

aggaccgaag 

tcgttgggaa 

tgtagcaatg 

ccggcaacaa 

ggcccttccg 

cggtatcatt 

gacggggagt 

actgattaag 

aaaacttcat 

caaaatccct 

aggatcttct 

accgctacca 

aactggcttc 

ccaccacttc 

agtggctgct 

accggataag 

gcgaacgacc 

tcccgaaggg 

cacgagggag 

cctctgactt 

cgccagcaac 

ctttcctgcg 

taccgctcgc 

gcgcccaata 

cgacaggttt 

cactcattag 

tgtgagcgga 

ccgggctagc 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
14 40 
1500 
1560 
1620 
1680 
1740 
1800 
18 60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
33 60 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3858 
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<210> 15 
<211> 6257 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
pRK64 (deltaCre) 

<400> 15 

cgtcatcacc gaaacgcgcg aggcagctgt ggaatgtgtg tcagttaggg tgtggaaagt 60 
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 120 
ggctccccag caggcagaag tgtgcaaagc atgcatctca attagtcagc aaccatagtc 180 
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ccgcccctaa 

catggctgac 

tcgacgacac 

tgaggtggag 

cgagaccgtc 

ggccaccatg 

aaaccctggc 

taatagcgaa 

atggcgcttt 

tcttcctgag 

gcccatctac 

gaatccgacg 

ccagacgcga 

ctgggtcggt 

acgcgccgga 

ggaagatcag 

accgactaca 

cgctgtactg 

agtttcttta 

aattatcgat 

cccgaaactg 

cgccgacggc 

tgaaaatggt 

cgagcatcat 

gctgatgaag 

gtggtacacg 

ccacggcatg 

cgaacgcgta 

getggggaat 

tgtcgatcct 

tattatttgc 

atggtccatc 

atacgcccac 

tcagtatccc 

atatgatgaa 

cgatcgccag 

gacggaagca 

agtgaccagc 

gctggatggt 

acagttgatt 

agtacgcgta 

gcagcagtgg 

cccgcatctg 

atttaaccgc 

gacgccgctg 

agcgacccgc 

ggccgaagca 

gaccgctcac 

gattgatggt 

gcatccggcg 

gctcggatta 

ctgggatctg 

gcgctgcggg 

caacatcagc 

cgcggaagaa 

ctcctggagc 

gttggtctgg 

acttctgtgg 

atataaaatt 

agattccaac 

ataagataca 

atttgtgaaa 

gttaacaaca 

ttttaaagca 

ggcctcgtga 

tcaggtggca 

cattcaaata 



ctccgcccat 
taattttttt 
tgcagagacc 
tacgcgcccg 
acgaatagat 
gtcgcgagta 
gttacccaac 
gaggcccgca 
gcctggtttc 
gccgatactg 
accaacgtaa 
ggttgttact 
attatttttg 
tacggccagg 
gaaaaccgcc 
gatatgtggc 
caaatcagcg 
gaggctgaag 
tggcagggtg 
gagcgtggtg 
tggagcgccg 
acgctgattg 
ctgctgctgc 
cctctgcatg 
cagaacaact 
ctgtgcgacc 
gtgccaatga 
acgcgaatgg 
gaatcaggcc 
tcccgcccgg 
ccgatgtacg 
aaaaaatggc 
gcgatgggta 
cgtttacagg 
aacggcaacc 
ttctgtatga 
aaacaccagc 
gaatacctgt 
aagccgctgg 
gaactgcctg 
gtgcaaccga 
cgtctggcgg 
accaccagcg 
cagtcaggct 
cgcgatcagt 
attgacccta 
gcgttgttgc 
gcgtggcagc 
agtggtcaaa 
cggattggcc 
gggccgcaag 
ccattgtcag 
acgcgcgaat 
cgctacagtc 
ggcacatggc 
ccgtcagtat 
tgtcaaaaat 
tgtgacataa 
tttaagtgta 
ctatggaact 
ttgatgagtt 
tttgtgatgc 
acaattgcat 
agtaaaacct 
tacgcctatt 
cttttcgggg 
tgtatccgct 



cccgccccta 
tatttatgca 
tacttcacta 
gggagcccaa 
ccataacttc 
gcttggcact 
ttaatcgcct 
ccgatcgccc 
cggcaccaga 
tcgtcgtccc 
cctatcccat 
cgctcacatt 
atggcgttaa 
acagtcgttt 
tcgcggtgat 
ggatgagcgg 
atttccatgt 
ttcagatgtg 
aaacgcaggt 
gttatgccga 
aaatcccgaa 
aagcagaagc 
tgaacggcaa 
gtcaggtcat 
ttaacgccgt 
gctacggcct 
atcgtctgac 
tgcagcgcga 
acggcgctaa 
tgcagtatga 
cgcgcgtgga 
tttcgctacc. 
acagtcttgg 
gcggcttcgt 
cgtggtcggc 
acggtctggt 
agcagttttt 
tccgtcatag 
caagcggtga 
aactaccgca 
acgcgaccgc 
aaaacctcag 
aaatggattt 
ttctttcaca 
tcacccgtgc 
acgcctgggt 
agtgcacggc 
atcaggggaa 
tggcgattac 
tgaactgcca 
aaaactatcc 
acatgtatac 
tgaattatgg 
aacagcaact 
tgaatatcga 
cggcggaatt 
aataataacc 
ttggacaaac 
taatgtgtta 
gatgaatggg 
tggacaaacc 
tattgcttta 
tcattttatg 
ctacaaatgt 
tttataggtt 
aaatgtgcgc 
catgagacaa 



actccgccca 
gaggccgagg 
acaaccggta 
gggcacgccc 
gtatagcata 
ggccgtcgtt 
tgcagcacat 
ttcccaacag 
agcggtgccg 
ctcaaactgg 
tacggtcaat 
taatgttgat 
ctcggcgttt 
gccgtctgaa 
ggtgctgcgt 
cattttccgt 
tgccactcgc 
cggcgagttg 
cgccagcggc 
tcgcgtcaca 
tctctatcgt 
ctgcgatgtc 
gccgttgctg 
ggatgagcag 
gcgctgttcg 
gtatgtggtg 
cgatgatccg 
tcgtaatcac 
tcacgacgcg 
aggcggcgga 
tgaagaccag 
. tggagagacg 
cggtttcgct 
ctgggactgg 
ttacggcggt 
ctttgccgac 
ccagttccgt 
cgataacgag 
agtgcctctg 
gccggagagc 
atggtcagaa 
tgtgacgctc 
ttgcatcgag 
gatgtggatt 
accgctggat 
cgaacgctgg 
agatacactt 
aaccttattt 
cgttgatgtt 
gctggcgcag 
cgaccgcctt 
cccgtacgtc 
cccacaccag 
gatggaaacc 
cggtttccat 
ccagctgagc 
gggcaggggg 
tacctacaga 
aactactgat 
agcagtggtg 
acaactagaa 
tttgtaacca 
tttcaggttc 
ggtatggctg 
aatgtcatga 
ggaaccccta 
taaccctgat 



gttccgccca 
ccgcctcggc 
cagttcgtgg 
tggcacccgc 
cattatacga 
ttacaacgtc 
ccccctttcg 
ttgcgcagcc 
gaaagctggc 
cagatgcacg 
ccgccgtttg 
gaaagctggc 
catctgtggt 
tttgacctga 
tggagtgacg 
gacgtctcgt 
tttaatgatg 
cgtgactacc 
accgcgcctt 
ctacgtctga 
gcggtggttg 
ggtttccgcg 
attcgaggcg 
acgatggtgc 
cattatccga 
gatgaagcca 
cgctggctac 
ccgagtgtga 
ctgtatcgct 
gccgacacca 
cccttcccgg 
cgcccgctga 
aaatactggc 
gtggatcagt 
gattttggcg 
cgcacgccgc 
ttatccgggc 
ctcctgcact 
gatgtcgctc 
gccgggcaac 
gccgggcaca 
j.cccgccgcgt 
ctgggtaata 
ggcgataaaa 
aacgacattg 
aaggcggcgg 
gctgatgcgg 
atcagccgga 
gaagtggcga 
gtagcagagc 
actgccgcct 
ttcccgagcg 
tggcgcggcg 
agccatcgcc 
atggggattg 
gccggtcgct 
gatctttgtg 
gatttaaagc 
tctaattgtt 
gaatgccaga 
tgcagtgaaa 
ttataagctg 
agggggaggt 
attatgatct 
taataatggt 
tttgtttatt 
aaatgcttca 



ttctccgccc 
ctaggaacag 
accagatggg 
accgcggctt 
agttataccg 
gtgactggga 
ccagctggcg 
tgaatggcga 
tggagtgcga 
gttacgatgc 
ttcccacgga 
tacaggaagg 
gcaacgggcg 
gcgcattttt 
gcagttatct 
tgctgcataa 
atttcagccg 
tacgggtaac 
tcggcggtga 
acgtcgaaaa 
aactgcacac 
aggtgcggat 
ttaaccgtca 
aggatatcct 
accatccgct 
atattgaaac 
cggcgatgag 
tcatctggtc 
ggatcaaatc 
cggccaccga 
ctgtgccgaa 
tcctttgcga 
aggcgtttcg 
cgctgattaa 
atacgccgaa 
atccagcgct 
aaaccatcga 
ggatggtggc 
cacaaggtaa 
tctggctcac 
tcagcgcctg 
cccacgccat 
agcgttggca 
aacaactgct 
gcgtaagtga 
gccattacca 
tgctgattac 
aaacctaccg 
gcgatacacc 
gggtaaactg 
gttttgaccg 
aaaacggtct 
acttccagtt 
atctgctgca 
gtggcgacga 
accattacca 
aaggaacctt 
tctaaggtaa 
tgtgtatttt 
tccagacatg 
aaaatgcttt 
caataaacaa 
gtgggaggtt 
gcggccgcag 
ttcttagacg 
tttctaaata 
ataatattga 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca 4260 
ttttgccttc ctgtttttgc tcacccagaa acgetggtga aagtaaaaga tgctgaagat 4320 
cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag 4380 
agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc 4440 
gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct 4500 
cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca 4560 
gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt 4620 
ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat 4680 
gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt 4740 
gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta 4800 
cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga 4 8 60 
ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt 4 920 
gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc 4980 
gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct 5040 
gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata 5100 
ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt 5160 
gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc 5220 
gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg 5280 
caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact 5340 
ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg 5400 
tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg 5460 
ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac 5520 
tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca 5580 
cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga 5640 
gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc 5700 
ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct 57 60 
gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg 5820 
agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct 5880 
tttgctcaca tgttctttec tgcgttatcc cctgattctg tggataaccg tattaccgcc 5940 
tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc 6000 
gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc cegcgcgttg gccgattcat 6060 
taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt 6120' 
aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt 6180 
atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta tgaccatgat 6240 
tacgccaagc tggcgcg 6257 



<210> 16 
<211> 6252 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
pRK64 (deltalnt) 

<400> 16 

cgtcatcacc gaaacgcgcg aggcagctgt ggaatgtgtg tcagttaggg tgtggaaagt 60 
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 120 
ggctccccag caggcagaag tgtgcaaagc atgcatctca attagtcagc aaccatagtc 180 
ccgcccctaa ctccgcccat cccgccccta , actccgccca gttccgccca ttctccgccc 240 
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctaggaacag 300 
tcgacgacac tgcagagacc tacttcacta acaaccggta cagttcgtgg accagatggg 360 
tgaggtggag tacgcgcccg gggagcccaa aggttacccc agttggggca ctactcccga 420 
aaaccgcttc tggatccata acttcgtata gcatacatta tacgaagtta taccgggcca 480 
ccatggtcgc gagtagcttg gcactggccg tcgttttaca acgtcgtgac tgggaaaacc 540 
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 600 
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc 660 
gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc 720 
ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca 780 
tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc 84 0 
cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga 900 
cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg 960 
tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg 1020 
ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag 1080 
atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga 1140 
ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg 1200 
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10 



15 



20 



15 



50 



40 



45 



50 



55 



60 
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tactggaggc 
ctttatggca 
tcgatgagcg 
aactgtggag 
acggcacgct 
atggtctgct 
atcatcctct 
tgaagcagaa 
acacgctgtg 
gcatggtgcc 
gcgtaacgcg 
ggaatgaatc 
atccttcccg 
tttgcccgat 
ccatcaaaaa 
cccacgcgat 
atccccgttt 
atgaaaacgg 
gccagttctg 
aagcaaaaca 
ccagcgaata 
atggtaagcc 
tgattgaact 
gcgtagtgca 
agtggcgtct 
atctgaccac 
accgccagtc 
cgctgcgcga 
cccgcattga 
aagcagcgtt 
ctcacgcgtg 
atggtagtgg 
cggcgcggat 
gattagggcc 
atctgccatt 
gcgggacgcg 
tcagccgcta 
aagaaggcac 
ggagcccgtc 
tctggtgtca 
tgtggtgtga 
aaatttttaa 
ccaacctatg 
atacattgat 
tgaaatttgt 
caacaacaat 
aagcaagtaa 
cgtgatacgc 
tggcactttt 
aaatatgtat 
gaagagtatg 
ccttcctgtt 
gggtgcacga 
tcgccccgaa 
attatcccgt 
tgacttggtt 
agaattatgc 
aacgatcgga 
tcgccttgat 
cacgatgcct 
tctagcttcc 
tctgcgctcg 
tgggtctcgc 
tatctacacg 
aggtgcctca 
gattgattta 
tctcatgacc 



tgaagttcag 
gggtgaaacg 
tggtggttat 
cgccgaaatc 
gattgaagca 
gctgctgaac 
gcatggtcag 
caactttaac 
cgaccgctac 
aatgaatcgt 
aatggtgcag 
aggccacggc 
cccggtgcag 
gtacgcgcgc 
atggctttcg 
gggtaacagt 
acagggcggc 
caacccgtgg 
tatgaacggt 
ccagcagcag 
cctgttccgt 
gctggcaagc 
gcctgaacta 
accgaacgcg 
ggcggaaaac 
cagcgaaatg 
aggctttctt 
tcagttcacc 
ccctaacgcc 
gttgcagtgc 
gcagcatcag 
tcaaatggcg 
tggcctgaac 
gcaagaaaac 
gtcagacatg 
cgaattgaat 
cagtcaacag 
atggctgaat 
agtatcggcg 
aaaataataa 
cataattgga 
gtgtataatg 
gaactgatga 
gagtttggac 
gatgctattg 
tgcattcatt 
aacctctaca 
ctatttttat 

cggggaaatg 

ccgctcatga 
agtattcaac 
tttgctcacc 
gtgggttaca 
gaacgttttc 
attgacgccg 
gagtactcac 
agtgctgcca 
ggaccgaagg 
cgttgggaac 
gtagcaatgg 
cggcaacaat 
gcccttccgg 
ggtatcattg 
acggggagtc 
ctgattaagc 
aaacttcatt 
aaaatccctt 



atgtgcggcg 
caggtcgcca 
gccgatcgcg 
ccgaatctct 
gaagcctgcg 
ggcaagccgt 
gtcatggatg 
gccgtgcgct 
ggcctgtatg 
ctgaccgatg 
cgcgatcgta 
gctaatcacg 
tatgaaggcg 
gtggatgaag 
ctacctggag 
cttggcggtt 
ttcgtctggg 
tcggcttacg 
ctggtctttg 
tttttccagt 
catagcgata 
ggtgaagtgc 
ccgcagccgg 
accgcatggt 
ctcagtgtga 
gatttttgca 
tcacagatgt 
cgtgcaccgc 
tgggtcgaac 
acggcagata 
gggaaaacct 
attaccgttg 
tgccagctgg 
tatcccgacc 
tataccccgt 
tatggcccac 
caactgatgg 
atcgacggtt 
gaattccagc 
taaccgggca 
caaactacct 
tgttaaacta 
atgggagcag 
aaaccacaac 
ctttatttgt 
ttatgtttca 
aatgtggtat 
aggttaatgt 
tgcgcggaac 
gacaataacc 
atttccgtgt 
cagaaacgct 
tcgaactgga 
caatgatgag 
ggcaagagca 
cagtcacaga 
taaccatgag 
agctaaccgc 
cggagctgaa 
caacaacgtt 
taatagactg 
ctggctggtt 
cagcactggg 
aggcaactat 
attggtaact 
tttaatttaa 
aacgtgagtt 



agttgcgtga 
gcggcaccgc 
tcacactacg 
atcgtgcggt 
atgtcggttt 
tgctgattcg 
agcagacgat 
gttcgcatta 
tggtggatga 
atccgcgctg 
atcacccgag 
acgcgctgta 
gcggagccga 
accagccctt 
agacgcgccc 
tcgctaaata 
actgggtgga 
gcggtgattt 
ccgaccgcac 
tccgtttatc 
acgagctcct 
ctctggatgt 
agagcgccgg 
cagaagccgg 
cgctccccgc 
tcgagctggg 
ggattggcga 
tggataacga 
gctggaaggc 
cacttgctga 
tatttatcag 
atgttgaagt 
cgcaggtagc 
gccttactgc 
acgtcttccc 
accagtggcg 
aaaccagcca 
tccatatggg 
tgagcgccgg 
ggggggatct 
acagagattt 
ctgattctaa 
tggtggaatg 
tagaatgcag 
aaccattata 
ggttcagggg 
ggctgattat 
catgataata 
ccctatttgt 
ctgataaatg 
cgcccttatt 
ggtgaaagta 
tctcaacagc 
cacttttaaa 
actcggtcgc 
aaagcatctt 
tgataacact 
ttttttgcac 
tgaagccata 
gcgcaaacta 
gatggaggcg 
tattgctgat 
gccagatggt 
ggatgaacga 
gtcagaccaa 
aaggatctag 
ttcgttccac 



ctacctacgg 
gcctttcggc 
tctgaacgtc 
ggttgaactg 
ccgcgaggtg 
aggcgttaac 
ggtgcaggat 
tccgaaccat 
agccaatatt 
gctaccggcg 
tgtgatcatc 
tcgctggatc 
caccacggcc 
cccggctgtg 
gctgatcctt 
ctggcaggcg 
tcagtcgctg 
tggcgatacg 
gccgcatcca 
cgggcaaacc 
gcactggatg 
cgctccacaa 
gcaactctgg 
gcacatcagc 
cgcgtcccac 
taataagcgt 
taaaaaacaa 
cattggcgta 
ggcgggccat 
tgcggtgctg 
ccggaaaacc 
ggcgagcgat 
agagcgggta 
cgcctgtttt 
gagcgaaaac 
cggcgacttc 
tcgccatctg 
gattggtggc 
tcgctaccat 
ttgtgaagga 
aaagctctaa 
ttgtttgtgt 
ccagatccag 
tgaaaaaaat 
agctgcaata 
gaggtgtggg 
gatctgcggc 
atggtttctt 
ttatttttct 
cttcaataat 
cccttttttg 
aaagatgctg 
ggtaagatcc 
gttctgctat 
cgcatacact 
acggatggca 
gcggccaact 
aacatggggg 
ccaaacgacg 
ttaactggcg 
gataaagttg 
aaatctggag 
aagccctccc 
aatagacaga 
gtttactcat 
gtgaagatcc 
tgagcgtcag 



gtaacagttt 
ggtgaaatta 
gaaaacccga 
cacaccgccg 
cggattgaaa 
cgtcacgagc 
atcctgctga 
ccgctgtggt 
gaaacccacg 
atgagcgaac 
tggtcgctgg 
aaatctgtcg 
accgatatta 
ccgaaatggt 
tgcgaatacg 
tttcgtcagt 
attaaatatg 
ccgaacgatc 
gcgctgacgg 
atcgaagtga 
gtggcgctgg 
ggtaaacagt 
ctcacagtac 
gcctggcagc 
gccatcccgc 
tggcaattta 
ctgctgacgc 
agtgaagcga 
taccaggccg 
attacgaccg 
taccggattg 
acaccgcatc 
aactggctcg 
gaccgctggg 
ggtctgcgct 
cagttcaaca 
ctgcacgcgg 
gacgactcct 
taccagttgg 
accttacttc 
ggtaaatata 
attttagatt 
acatgataag 
gctttatttg 
aacaagttaa 
aggtttttta 
cgcagggcct 
agacgtcagg 
aaatacattc 
attgaaaaag 
cggcattttg 
aagatcagtt 
ttgagagttt 
gtggcgcggt 
attctcagaa 
tgacagtaag 
tacttctgac 
atcatgtaac 
agcgtgacac 
aactacttac 
caggaccact 
ccggtgagcg 
gtatcgtagt 
tcgctgagat 
atatacttta 
tttttgataa 
accccgtaga 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 



56 



aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 5280 
aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 5340 
tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc 5400 
gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 54 60 
cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 5520 
acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 5580 
cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag 5640 
cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 5700 
aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 5760 
gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 5820 
atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 5880 
teacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 5940 
gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 6000 
agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga ttcattaatg 6060 
cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg caattaatgt 6120 
gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt 6180 
gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc 6240 
caagctggcg eg 6252 



<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Seguence 

<220> 

<223> Description of Artificial Seguence: primer P64-1 
<400> 17 

tcagcaacca ggctccccag caggc 25 



<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Seguence 



<220> 

<223> Description of Artificial Sequence: primer 64-4 
<400> 18 

gacgacagta tcggcctcag gaagatc 27 

<210> 19 
<211> 840 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 80d 



<400> 19 

ggtaccgagc 

ccaggctccc 

tgtggaaagt 

tcagcaacca 

gcccattctc 

teggectagg 

cgtggaccag 

gggcactact 

agttataccg 

gtgactggga 

ccagctggcg 

gaatggcgaa 

ggagtgcgat 



tcggatcctc 
cagcaggcag 
ccccaggctc 
tagtcccgcc 
cgccccatgg 
aacagtcgac 
atgggtgagg 
cccgaaaacc 
ggccaccatg 
aaaccctggc 
taatagcgaa 
tggcgctttg 
cttcctgagg 



tagtaaegge 
aagtatgcaa 
cccagcaggc 
cctaactccg 
ctgactaatt 
gaeactgeag 
tggagtaege 
gcttctggat 
gtcgcgagta 
gttacccaac 
gaggcccgca 
cctggcttcc 
ccgatactgt 



cgccagtgtg 
ageatgeate 
agaagtatgc 
cccatcccgc 
ttttttattt 
agacctactt 
geceggggag 
ccataacttc 
gcttggcact 
ttaatcgect 
ccgatcgccc 
ggcaccagaa 
Cgtcaagccg 



ctggaattcg 
tcaattagtc 
aaagcatgea 
ccctaactcc 
atgeagagge 
cactaacaac 
cccaaaggtt 
gtatagcata 
ggggttgctt 
tgcagcacat 
ttcccaacag 
gcggtgccgg 
aattctgcag 



gcttcagcaa 
agcaaccagg 
tctcaattag 
gcccagttcc 
cgaggccgcc 
eggtacagtt 
accccagttg 
cattatacga 
ttgcgnygtc 
ccccctttcg 
ttgegcaget 
aaagctggct 
atatccatca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



57 



cactggcggc cgctcgagca tgcatctaga gggccaattc gccctatagt gagtcgtatt 840 

<210> 20 
5 <211> 1842 
<212> DNA 

<213> Bacteriophage phi-C31 

<220> 
10 <221> CDS 

<222> (1) . . (1839) 

<400> 20 

atg aca caa ggg gtt gtg acc ggg gtg gac acg tac gcg ggt get tac 48 

15 Met Thr Gin Gly Val Val Thr Gly Val Asp Thr Tyr Ala Gly Ala Tyr 
15 10 15 

gac cgt cag teg cgc gag cgc gag aat teg age gca gca age cca gcg 96 
; Asp Arg Gin Ser Arg Glu Arg Glu Asn Ser Ser Ala Ala Ser Pro Ala 
.0 ' ~ 20 " 25 30 

aca cag cgt age gec aac gaa gac aag gcg gec gac ctt cag cgc gaa 14 4 
Thr Gin Arg Ser Ala Asn Glu Asp Lys Ala Ala Asp Leu Gin Arg Glu 
35 40 45 

■5 

gtc gag cgc gac ggg ggc egg ttc agg ttc gtc ggg cat ttc age gaa 192 
Val Glu Arg Asp Gly Gly Arg Phe Arg Phe Val Gly His Phe Ser Glu 
50 55 60 

,0 gcg ccg ggc acg teg gcg ttc ggg acg gcg gag cgc ccg gag ttc gaa 240 

Ala Pro Gly Thr Ser Ala Phe Gly Thr Ala Glu Arg Pro Glu Phe Glu 
j 65 70 75 80 

cgc ate ctg aac gaa tgc cgc gec ggg egg etc aac atg ate att gtc 288 
5 Arg lie Leu Asn Glu Cys Arg Ala Gly Arg Leu Asn Met lie lie Val 

85 90 95 

\ tat gac gtg teg cgc ttc teg cgc ctg aag gtc atg gac gcg att ccg 336 
Tyr Asp Val Ser Arg Phe Ser Arg Leu Lys Val Met Asp Ala lie Pro 
40 100 105 * " 110 

att gtc teg gaa ttg etc gec ctg ggc gtg acg att gtt tec act cag 384 
lie Val Ser Glu Leu Leu Ala Leu Gly Val' Thr lie Val Ser Thr Gin 
115 120 125 



45 
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gaa ggc gtc ttc egg cag gga aac gtc atg gac ctg att cac ctg att 432 
Glu Gly Val Phe Arg Gin Gly Asn Val Met Asp Leu lie His Leu lie 
130 135 140 



50 atg egg etc gac gcg teg cac aaa gaa tct teg ctg aag teg "gcg aag 480 

Met Arg Leu Asp Ala Ser His Lys Glu Ser Ser Leu Lys Ser Ala Lys 

145 150 * 155 * 160 

att etc gac acg aag aac ctt cag cgc gaa ttg ggc ggg tac gtc ggc 528 

55 lie Leu Asp Thr Lys Asn Leu Gin Arg Glu Leu Gly Gly Tyr Val Gly 

165 170 175 

ggg aag gcg cct tac ggc ttc gag ctt gtt teg gag acg aag gag ate 576 

Gly Lys Ala Pro Tyr Gly Phe Glu Leu Val Ser Glu Thr Lys Glu He 

60 180 185 190 



acg cgc aac ggc cga atg gtc aat gtc gtc ate aac aag ctt gcg cac 624 
Thr Arg Asn Gly Arg Met Val Asn Val Val He Asn Lys Leu Ala His 
195 200 205 

teg acc act ccc ctt acc gga ccc ttc gag ttc gag ccc gac gta ate 672 
Ser Thr Thr Pro Leu Thr Gly Pro Phe Glu Phe Glu Pro Asp Val He 



58 



210 



215 



220 



egg tgg tgg tgg cgt gag ate aag acg cac aaa cac ctt ccc ttc aag 
Arg Trp Trp Trp Arg Glu lie Lys Thr His Lys His Leu Pro Phe Lys 

225 230 235 240 



720 



10 



ccg ggc agt caa gec gec att cac ccg ggc age ate acg ggg ctt tgt 768 
Pro Gly Ser Gin Ala Ala He His Pro Gly Ser He Thr Gly Leu Cys 
245 250 255 

aag cgc atg gac get gac gec gtg ccg acc egg ggc gag acg att ggg 816 
Lys Arg Met Asp Ala Asp Ala Val Pro Thr Arg Gly -Glu Thr He Gly 
260 265 ~ 270 



15 aag aag acc get tea age gec tgg gac ccg gca acc gtt atg cga ate 
Lys Lys Thr Ala Ser Ser Ala Trp Asp Pro Ala Thr Val Met Arg He 
275 280 285 



864 



ctt egg gac ccg cgt att gcg ggc ttc gec get gag gtg ate tac aag 
Leu Arg Asp Pro Arg He Ala Gly Phe Ala Ala Glu Val He Tyr Lys 
290 295 300 



912 



aag aag ccg gac ggc acg ccg acc acg aag att gag ggt tac cgc att 
Lys Lys Pro Asp Gly Thr Pro Thr Thr Lys He Glu Gly Tyr Arg He 
305 310 315 320 



960 



cag cgc gac ccg ate acq etc egg ccg gtc gag ctt gat tgc gga ccg 
Gin Arg Asp Pro He Thr Leu Arg Pro Val Glu Leu Asp Cys Gly Pro 

325 330 335 



1008 



.5 



ate ate gag ccc get gag tgg tat gag ctt cag gcg tgg ttg gac ggc 1056 

He He Glu Pro Ala Glu Trp Tyr Glu Leu Gin Ala Trp Leu Asp Gly 
340 * 345 350 

agg ggg cgc ggc aag ggg ctt tec egg ggg caa gee att ctg tec gec 1104 

Arg Gly Arg Gly Lys Gly Leu Ser Arg Gly Gin Ala He Leu Ser Ala 
355 360 365 



,' atg gac aag ctg tac tgc gag tgt ggc gec gtc atg act teg aag cgc 
10 Met Asp Lys Leu Tyr Cys Glu Cys Gly Ala Val Met Thr Ser Lys Arg 
370 375 * " 380 



1152 



45 



ggg gaa gaa teg ate aag gac tct tac cgc tgc cgt cgc egg aag gtg 
Gly Glu Glu Ser He Lys Asp Ser Tyr Arg Cys Arg Arg Arg Lys Val 
385 390 395 ' 400 



1200 



50 



55 



gtc gac ccg tec gca cct ggg cag cac gaa ggc acg tgc aac gtc age 1248 
Val Asp Pro Ser Ala Pro Gly Gin His Glu Gly Thr Cys Asn Val Ser 
405 410 415 

atg gcg gca etc gac aag ttc gtt gcg gaa cgc ate ttc aac aag ate 1296 
Met Ala Ala Leu Asp Lys Phe Val Ala Glu Arg He Phe Asn Lys He 
420 425 430 

agg cac gee gaa ggc gac gaa gag acg ttg gcg ctt ctg tgg gaa gee 134 4 
Arg His Ala Glu Gly Asp Glu Glu Thr Leu Ala Leu Leu Trp Glu Ala 
435 440 445 



gec cga cgc ttc ggc aag etc act gag gcg cct gag aag age ggc gaa 
60 Ala Arg Arg Phe Gly Lys Leu Thr Glu Ala Pro Glu Lys Ser Gly Glu 
450 455 460 



1392 



65 



egg gcg aac ctt gtt gcg gag cgc gec gac gec ctg aac gec ctt gaa 
Arg Ala Asn Leu Val Ala Glu Arg Ala Asp Ala Leu Asn Ala Leu Glu 
465 470 475 480 



1440 



gag ctg tac gaa gac cgc gcg gca ggc gcg tac gac gga ccc gtt ggc 1488 



59 

Glu Leu Tyr Glu Asp Arg Ala Ala Gly Ala Tyr Asp Gly Pro Val Gly 
485 490 * 495 

agg aag cac ttc egg aag caa cag gca gcg ctg acg etc egg cag caa 1536 
5 Arg Lys His Phe Arg Lys Gin Gin Ala Ala Leu Thr Leu Arg Gin Gin 
500 505 510 

ggg gcg gaa gag egg ctt gee gaa ctt gaa gee gee gaa gee ccg aag 1584 
Gly Ala Glu Glu Arg Leu Ala Glu Leu Glu Ala Ala Glu Ala Pro Lys 
10 515 520 525 

ctt ccc ctt gac caa tgg ttc ccc gaa gaegee gac -get gac ccg acc 1632 
Leu Pro Leu Asp Gin Trp Phe Pro Glu Asp Ala Asp Ala Asp Pro Thr 
530 535 540 

15 

ggc cct aag teg tgg tgg ggg cgc gcg tea gta gac gac aag cgc gtg 1680 
Gly Pro Lys Ser Trp Trp Gly Arg Ala Ser Val Asp Asp Lys Arg Val 
545 550 555 560 

10 ttc gtc ggg etc ttc gta gac aag ate gtt gtc acg aag teg act acg 1728 
Phe Val Gly Leu Phe Val Asp Lys lie Val Val Thr Lys Ser Thr Thr 
565 570 575 

ggc agg ggg cag gga acg ccc ate gag aag cgc get teg ate acg tgg 177 6 
15 Gly Arg Gly Gin Gly Thr Pro lie Glu Lys Arg Ala Ser lie Thr Trp 
580 585 590 

gcg aag ccg ccg acc gac gac gac gaa gac gac gee cag gac ggc acg 1824 
Ala Lys Pro Pro Thr Asp Asp Asp Glu Asp Asp Ala Gin Asp Gly Thr 
10 595 600 605 

gaa gac gta gcg gcg tag 1842 
Glu Asp Val Ala Ala 
610 

15 

i <210> 21 

<211> 613 
1 <212> PRT 
■+0 <213> Bacteriophage phi-C31 

<400> 21 ■ %,v 

Met Thr Gin Gly Val Val Thr Gly Val Asp Thr Tyr Ala Gly^Ala Tyr 
15 10 15 



45 



60 



Asp Arg Gin Ser Arg Glu Arg Glu Asn Ser Ser Ala Ala Ser. : . Pro Ala 
20 25 30 



Thr Gin Arg Ser Ala Asn Glu Asp Lys Ala Ala Asp . Leu Gin Arg Glu 
50 35 40 4 5 

Val Glu Arg Asp Gly Gly Arg Phe Arg Phe Val Gly His Phe Ser Glu 

50 55 60 

55 Ala Pro Gly Thr Ser Ala Phe Gly Thr Ala Glu Arg Pro Glu Phe Glu 

65 70 75 80 



Arg lie Leu Asn Glu Cys Arg Ala Gly Arg Leu Asn Met lie He Val 

85 90 95 

Tyr Asp Val Ser Arg Phe Ser Arg Leu Lys Val Met Asp Ala He Pro 
100 105 110 



He Val Ser Glu Leu Leu Ala Leu Gly Val Thr He Val Ser Thr Gin 
65 . 115 120 125 

Glu Gly Val Phe Arg Gin Gly Asn Val Met Asp Leu lie His Leu He 



60 



130 135 140 

Met Arg Leu Asp Ala Ser His Lys Glu Ser Ser Leu Lys Ser Ala Lys 
145 150 155 ' 160 

He Leu Asp Thr Lys Asn Leu Gin Arg Glu Leu Gly Gly Tyr Val Gly 
165 170 " ' 175 

Gly Lys Ala Pro Tyr Gly Phe Glu Leu Val Ser Glu Thr Lys Glu He 
180 185 190 

Thr Arg Asn Gly Arg Met Val Asn Val Val He Asn -Lys Leu Ala His 
195 200 205 

Ser Thr Thr Pro Leu Thr Gly Pro Phe Glu Phe Glu Pro Asp Val He 
210 215 220 

Arg Trp Trp Trp Arg Glu lie Lys Thr His Lys His Leu Pro Phe Lys 
225 230 235 240 

Pro Gly Ser Gin Ala Ala He His Pro Gly Ser He Thr Gly Leu Cys 
245 250 255 

Lys Arg Met Asp Ala Asp Ala Val Pro Thr Arg Gly Glu Thr He Gly 
260 265 270 

Lys Lys Thr Ala Ser Ser Ala Trp Asp Pro Ala Thr Val Met Arg lie 
275 280 285 

Leu Arg Asp Pro Arg lie Ala Gly Phe Ala Ala Glu Val He Tyr Lys 
290 295 * 300 

Lys Lys Pro Asp Gly Thr Pro Thr Thr Lys He Glu Gly Tyr Arg lie 
305 310 315 320 

Gin Arg Asp Pro He Thr Leu Arg Pro Val Glu Leu Asp Cys Gly Pro 
325 330 * 335 

lie lie Glu Pro Ala Glu Trp Tyr Glu Leu Gin Ala Trp Leu Asp Gly 
340 * 345 " 350 

Arg Gly Arg Gly Lys Gly Leu Ser Arg Gly Gin Ala He Leu Ser Ala 

355 360 -: 365 

Met Asp Lys Leu Tyr Cys Glu Cys Gly Ala Val Met Thr Ser Lys Arg 
370 375 380 

Gly Glu Glu Ser lie Lys Asp Ser Tyr Arg Cys Arg Arg Arg Lys Val 
385 390 395 " 400 

Val Asp Pro Ser Ala Pro Gly Gin His Glu Gly Thr Cys Asn Val Ser 
405 410 415 

Met Ala Ala Leu Asp Lys Phe Val Ala Glu Arg lie Phe Asn Lys lie 
420 425 430 

Arg His Ala Glu Gly Asp Glu Glu Thr Leu Ala Leu Leu Trp Glu Ala 
435 440 445 

Ala Arg Arg Phe Gly Lys Leu Thr Glu Ala Pro Glu Lys Ser Gly Glu 
450 455 460 

Arg Ala Asn Leu Val Ala Glu Arg Ala Asp Ala Leu Asn Ala Leu Glu 
465 470 475 480 

Glu Leu Tyr Glu Asp Arg Ala Ala Gly Ala Tyr Asp Gly Pro Val Gly 
485 4 90 ■ /i95 



10 



15 



45 



65 



61 



Arg Lys His Phe Arg Lys Gin Gin Ala Ala Leu Thr Leu Arg Gin Gin 

500 505 510 

Gly Ala Glu Glu Arg Leu Ala Glu Leu Glu Ala Ala Glu Ala Pro Lys 
515 520 525 

Leu Pro Leu Asp Gin Trp Phe Pro Glu Asp Ala Asp Ala Asp Pro Thr 
530 535 540 

Gly Pro Lys Ser Trp Trp Gly Arg Ala Ser Val Asp Asp Lys Arg Val 
545 " 550 " 555 • 560 



Phe Val Gly Leu Phe Val Asp Lys lie Val Val Thr Lys Ser Thr Thr 

15 565 570 575 

Gly Arg Gly Gin Gly Thr Pro lie Glu Lys Arg Ala Ser lie Thr Trp 
580 585 590 

.10 Ala Lys Pro Pro Thr Asp Asp Asp Glu Asp Asp Ala Gin Asp Gly Thr 

; 595 " 600 * 605 



Glu Asp Val Ala Ala 
610 



! <210> 22 

<211> 1863 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

I <223> Description of Artificial Sequence: DNA sequence 

35 coding for fusion protein C31-Int (CNLS) 

i 

<220> 

<221> CDS 

<222> (1) . . (1860) 

40 

<400> 22 

atg aca caa ggg gtt gtg acc ggg gtg gac'acg tac gcg ggt get tac 48 
Met Thr Gin Gly Val Val Thr Gly Val Asp Thr Tyr Ala Gly Ala Tyr 
1 5 10 15 

gac cgt cag teg cgc gag cgc gag aat teg age gca gca age cca gcg 96 
Asp Arg Gin Ser Arg Glu Arg Glu Asn Ser Ser Ala Ala Ser Pro Ala 
20 25 30 

50 aca cag cgt age gee aac gaa gac aag gcg gee gac ctt cag cgc gaa 14 4 
Thr Gin Arg Ser Ala Asn Glu Asp Lys Ala Ala Asp Leu Gin Arg Glu 
35 40 45 

gtc gag cgc gac ggg ggc egg ttc agg ttc gtc ggg cat ttc age gaa 192 
55 Val Glu Arg Asp Gly Gly Arg Phe Arg Phe Val Gly His Phe Ser Glu 
50 55 " 60 

gcg ccg ggc acg teg gcg ttc ggg acg gcg gag cgc ccg gag ttc gaa 240 
Ala Pro Gly Thr Ser Ala Phe Gly Thr Ala Glu Arg Pro Glu Phe Glu 
60 65 70 75 80 

cgc ate ctg aac gaa tgc cgc gee ggg egg etc aac atg ate att gtc 288 
Arg lie Leu Asn Glu Cys Arg Ala Gly Arg Leu Asn Met lie lie Val 
85 90 95 



tat gac gtg teg cgc ttc teg cgc ctg aag gtc atg gac gcg att ccg 336 
Tyr Asp Val Ser Arg Phe Ser Arg Leu Lys Val Met Asp Ala lie Pro 



'' 62 



100 105 HO 

att gtc teg gaa ttg etc gee ctg ggc gtg acg att gtt tec act cag 
lie Val Ser Glu Leu Leu Ala Leu Gly Val Thr lie Val Ser Thr Gin 
115 120 125 

gaa ggc gtc ttc egg cag gga aac gtc atg gac ctg att cac ctg att 
Glu Gly Val Phe Arg Gin Gly Asn Val Met Asp Leu lie His Leu lie 
130 135 140 

atg egg etc gac gcg teg cac aaa gaa tct teg ctg aag teg gcg aag 
Met Arg Leu Asp Ala Ser His Lys Glu Ser Ser Leu -Lys Ser Ala Lys 
145 150 155 160 

att etc gac acg aag aac ctt cag cgc gaa ttg ggc ggg tac gtc ggc 
lie Leu Asp Thr Lys Asn Leu Gin Arg Glu Leu Gly Gly Tyr Val Gly 
165 170 175 

ggg aag gcg cct tac ggc ttc gag ctt gtt teg gag acg aag gag ate 
Gly Lys Ala Pro Tyr Gly Phe Glu Leu Val Ser Glu Thr Lys Glu lie 
180 185 190 

acg cgc aac ggc cga atg gtc aat gtc gtc ate aac aag ctt gcg cac 
Thr Arg Asn Gly Arg Met Val Asn Val Val lie Asn Lys Leu Ala His 
195 200 205 

teg ace act ccc ctt acc gga ccc ttc gag ttc gag ccc gac gta ate 
Ser Thr Thr Pro Leu Thr Gly Pro Phe Glu Phe Glu Pro Asp Val lie 
210 215 220 

egg tgg tgg tgg cgt gag ate aag acg cac aaa cac ctt ccc ttc aag 
Arg Trp Trp Trp Arg Glu He Lys Thr His Lys His Leu Pro Phe Lys 
225 230 235 240 

ccg ggc agt caa gee gee att cac ccg ggc age ate acg ggg ctt tgt 
Pro Gly Ser Gin Ala Ala He His Pro Gly Ser He Thr Gly Leu Cys 
245 250 255 

aag cgc atg gac get gac gee gtg ccg acc egg ggc gag acg att ggg 
Lys Arg Met Asp Ala Asp Ala Val Pro Thr Arg Gly Glu Thr He Gly 
260 265 * 270 

aag aag acc get tea age gec tgg gac ccg gca acc gtt atg cga ate 
Lys Lys Thr Ala Ser Ser Ala Trp Asp Pro Ala Thr Val Met Arg He 
275 280 285 

ctt egg gac ccg cgt att gcg ggc ttc gec get gag gtg ate tac aag 
Leu Arg Asp Pro Arg He Ala Gly Phe Ala Ala Glu Val He Tyr Lys 
290 295 300 

aag aag ccg gac ggc acg ccg acc acg aag att gag ggt tac cgc att 
Lys Ly<; Pro Asp Gly Thr Pro Thr Thr Lys He Glu Gly Tyr Arg He 
305 310 315 320 

cag cgc gac ccg ate acg etc egg ccg gtc gag ctt gat tgc gga ccg 
Gin Arg Asp Pro He Thr Leu Arg Pro Val Glu Leu Asp Cys Gly Pro 
325 330 335 



ate ate gag ccc get gag tgg tat gag ctt cag gcg tgg ttg gac ggc 
He He Glu Pro Ala Glu Trp Tyr Glu Leu Gin Ala Trp Leu Asp Gly 
340 345 350 



agg ggg cgc ggc aag ggg ctt tec egg ggg caa gee att ctg tec gee 
Arg Gly Arg Gly Lys Gly Leu Ser Arg Gly Gin Ala He Leu Ser Ala 
355 360 365 

atg gac aag ctg tac tgc gag tgt ggc gec gtc atg act teg aag cgc 



63 



Met Asp Lys Leu Tyr Cys Glu Cys Gly Ala Val Met Thr Ser Lys Arg 
370 375 ~ 380 

ggg gaa gaa teg ate aag gac tct tac cgc tgc cgt cgc egg aag gtg 
Gly Glu Glu Ser lie Lys Asp Ser Tyr Arg Cys Arg Arg Arg Lys Val 
385 390 395 ' ' 400 

gtc gac ccg tec gca cct ggg cag cac gaa ggc acg tgc aac gtc age 
Val Asp Pro Ser Ala Pro Gly Gin His Glu Gly Thr Cys Asn Val Ser 
405 410 415 

atg gcg gca etc gac aag ttc gtt gcg gaa cgc ate- ttc aac aag ate 
Met Ala Ala Leu Asp Lys Phe Val Ala Glu Arg lie Phe Asn Lys lie 
420 425 " 430 

agg cac gec gaa ggc gac gaa gag acg ttg gcg ctt ctg tgg gaa gec 
Arg His Ala Glu Gly Asp Glu Glu Thr Leu Ala Leu Leu Trp Glu Ala 
435 440 445 

gec cga cgc ttc ggc aag etc act gag gcg cct gag aag age ggc gaa 
Ala Arg Arg Phe Gly Lys Leu Thr Glu Ala Pro Glu Lys Ser Gly Glu 
4 50 ' ~ 455 460 

egg gcg aac ctt gtt gcg gag cgc gec gac gec ctg aac gec ctt gaa 
Arg Ala Asn Leu Val Ala Glu Arg Ala Asp Ala Leu Asn Ala Leu Glu 
465 _ 470 475 480 

gag ctg tac gaa gac cgc gcg gca ggc gcg tac gac gga ccc gtt ggc 
Glu Leu Tyr Glu Asp Arg Ala Ala Gly Ala Tyr Asp Gly Pro Val Gly 
4 85 4 90 * - 495 " 

agg aag cac ttc egg aag caa cag gca gcg ctg acg etc egg cag caa 
Arg Lys His Phe Arg Lys Gin Gin Ala Ala Leu Thr Leu Arg Gin Gin 
500 505 510 

ggg gcg gaa gag egg ctt gec gaa ctt gaa gee gee gaa gec ccg aag 
Gly Ala Glu Glu Arg Leu Ala Glu Leu Glu Ala Ala Glu Ala Pro Lys 
515 520 525 

ctt ccc ctt gac caa tgg ttc ccc gaa gac gec gac get gac ccg acc 
Leu Pro Leu Asp Gin Trp Phe Pro /Glu Asp Ala Asp Ala Asp Pro Thr 

530 535 'f-r 540 

ggc cct aag teg tgg tgg ggg cgc gcg tea gta gac gac aag cgc gtg 
Gly Pro Lys Ser Trp Trp Gly Arg Ala Ser Val Asp Asp Lys Arg Val 
545 550 555 ' * * 560 

ttc gtc ggg etc ttc gta gac aag ate gtt gtc acg aag teg act acg 
Phe Val Gly Leu Phe Val Asp Lys lie Val Val Thr Lys Ser Thr Thr 
565 570 575 

ggc agg ggg cag gga acg ccc ate gag aag cgc get teg ate acg tgg 
Gly Arg Gly Gin Gly Thr Pro lie Glu Lys Arg Ala Ser lie Thr Trp 
580 .585 590 

gcg aag ccg ccg acc gac gac gac gaa gac gac gec cag gac ggc acg 
Ala Lys Pro Pro Thr Asp Asp Asp Glu Asp Asp Ala Gin Asp Gly Thr 
595 600 605 

gaa gac gta gcg gcg cct aag aag aag agg aag gtt tag 
Glu Asp Val Ala Ala Pro Lys Lys Lys Arg Lys Val 
610 615 " ' 620 



<210> 23 
<211> 620 
<212> PRT 



64 



<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for fusion protein C31-Int (CNLS ) 

5 <400> 23 

Met Thr Gin Gly Val Val. Thr Gly Val Asp Thr Tyr Ala Gly Ala Tyr 
15 10 15 

Asp Arg Gin Ser Arg Glu Arg Glu Asn Ser Ser Ala Ala Ser Pro Ala 
10 20 25 30 

Thr Gin Arg Ser Ala Asn Glu Asp Lys Ala Ala Asp. Leu Gin Arg Glu 
35 40 45 

15 Val Glu Arg Asp Gly Gly Arg Phe Arg Phe Val Gly His Phe Ser Glu 
50 " 55 60 



20 



i 



35 



50 



65 



Ala Pro Gly Thr Ser Ala Phe Gly Thr Ala Glu Arg Pro Glu Phe Glu 
65 70 75 80 

Arg lie Leu Asn Glu Cys Arg Ala Gly Arg Leu Asn Met lie lie Val 
85 90 95 



j Tyr Asp Val Ser Arg Phe Ser Arg Leu Lys Val Met Asp Ala lie Pro 

25 100 105 " 110 

I lie Val Ser Glu Leu Leu Ala Leu Gly Val Thr He Val Ser Thr Gin 

115 120 125 

30 Glu Gly Val Phe Arg Gin Gly Asn Val Met Asp Leu He His Leu He 

I 130 135 140 



Met Arg Leu Asp Ala Ser His Lys Glu Ser Ser Leu Lys Ser Ala Lys 

145 150 155 160' 

lie Leu Asp Thr Lys Asn Leu Gin Arg Glu Leu Gly Gly Tyr Val Gly 

165 170 175 



Gly Lys Ala Pro Tyr Gly Phe Glu Leu Val Ser Glu Thr Lys Glu He 
40 180 185 190 

Thr Arg Asn Gly Arg Met Val Asn Val Val He Asn Lys Leu Ala His 
195 200 205 

45 Ser Thr Thr Pro Leu Thr Gly Pro Phe Glu Phe Glu Pro Asp Val He 
210 215 220 



Arg Trp Trp Trp Arg Glu He Lys Thr His Lys His Leu Pro Phe Lys 
225 230 235 240 

Pro Gly Ser Gin Ala Ala He His Pro Gly Ser lie Thr Gly Leu Cys 
245 250 255 



Lys Arg Met Asp Ala Asp Ala Val Pro Thr Arg Gly Glu Thr lie Gly 
55 260 265 * 270 

Lys Lys Thr Ala Ser Ser Ala Trp Asp Pro Ala Thr Val Met Arg He 
275 280 285 

60 Leu Arg Asp Pro Arg He Ala Gly Phe Ala Ala Glu Val lie Tyr Lys 
290 295 300 



Lys Lys Pro Asp Gly Thr Pro Thr Thr Lys He Glu Gly Tyr Arg He 

305 310 315 320 

Gin Arg Asp Pro He Thr Leu Arg Pro Val Glu Leu Asp Cys Gly Pro 

325 330 335 



65 



$, ciitT/Mf ins CM! vj; ;1 | i pifji 



10 



He He Glu Pro Ala Glu Trp Tyr Glu Leu Gin Ala Trp Leu Asp Gly 

340 345 350 

Arg Gly Arg Gly Lys Gly Leu Ser Arg Gly Gin Ala He Leu Ser Ala 
355 360 365 

Met Asp Lys Leu Tyr Cys Glu Cys Gly Ala Val Met Thr Ser Lys Arq 
3^0 375 380 

Gly Glu Glu Ser He Lys Asp Ser Tyr Arg Cys Arg Arg Arg Lys Val 
385 390 395 - 400 

Val Asp Pro Ser Ala Pro Gly Gin His Glu Gly Thr Cys Asn Val Ser 
15 405 410 415 

Met Ala Ala Leu Asp Lys Phe Val Ala Glu Arg He Phe Asn Lys He 
420 425 " 430 

20 Arg His Ala Glu Gly Asp Glu Glu Thr Leu Ala Leu Leu Trp Glu Ala 
435 440 445 

Ala Arg Arg Phe Gly Lys Leu Thr Glu Ala Pro Glu Lys Ser Gly Glu 

25 450 455 460 

I Arg Ala Asn Leu Val Ala Glu Arg Ala Asp Ala Leu Asn Ala Leu Glu 

465 470 475 480 

Glu Leu Tyr Glu Asp Arg Ala Ala Gly Ala Tyr Asp Gly Pro Val Gly 
30 485 490 495 

Arg Lys His Phe Arg Lys Gin Gin Ala Ala Leu Thr Leu Arg Gin Gin 
500 505 510 

Gly Ala Glu Glu Arg Leu Ala Glu Leu Glu Ala Ala Glu Ala Pro Lys 
515 520 525 

Leu Pro Leu Asp Gin Trp Phe Pro Glu Asp Ala Asp Ala Asp Pro Thr 
530 535 ' 540 

Gly Pro Lys Ser Trp Trp Gly Arg Ala Ser Val Asp Asp Lys Arg Val 
545 ' 550 555 560 

Phe Val Gly Leu Phe Val Asp Lys He Val Val Thr Lys Ser Thr Thr 
565 570 575 

Gly Arg Gly Gin Gly Thr Pro He Glu Lys Arg Ala Ser He Thr Trp 
580 585 590 

Ala Lys Pro Pro Thr Asp Asp Asp Glu Asp Asp Ala Gin Asp Gly Thr 
595 600 605 

Glu Asp Val Ala Ala Pro Lys Lys Lys Arg Lys Val 
610 615 620 



35 



40 



45 



50 



55 



<210> 24 
<211> 43 
60 <212> PRT 

<213> Artificial Sequence 



65 



<220> 

<223> Description of Artificial Sequence: NLS 
<400> 24 

Met Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Cys Arg Leu Lys 



$0 



35 



40 



45 



55 



60 



66 ^ ' 

10 15 



Lys Leu Lys Cys Ser Lys Glu Lys Pro Lys Cys Ala Lys Cys Leu Lys 
20 25 30 

Lys Lys Lys Lys Arg Arg Arg Lys Thr Lys Arg 
35 40 

10 <210> 25 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence: NLS 
<400> 25 

lie Lys Tyr Phe Lys Lys Phe Pro Lys Asp 

;0 1 5 10 

I 

I <210> 26 

<211> 14 

;5 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: NLS 
<400> 26 

Met Thr Gly Ser Lys Thr Arg Lys His Arg Gly Ser Gly Ala 
15 10 



<210> 27 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: NLS 
<400> 27 

Met Thr Gly Ser Lys His Arg Lys His Pro Gly Ser Gly Ala 
15 10 



<210> 28 
50 <211> 7 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: NLS 
<400> 28 

Gly Lys Lys Arg Ser Lys Ala 
1 5 



<210> 29 
<211> 14 
<212> PRT 
65. <213> Artificial Sequence 



<220> 



•H i '!: « ' t! n.iM !; il !i s : 'i 



67 



<223> Description of Artificial Sequence: NLS 

<400> 29 

Pro Lys Lys Ala Arg Glu Asp Val Ser Arg Lys Arg Pro Arg 
1 5 10 



<210> 30 
<211> 11 
10 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 

15 

<400> 30 

Ala Pro Lys Arg Lys Ser Gly Val Ser Lys Cys 
1 5 10 

10 

\ <210> 31 

; <2ii> 12 

: <212> PRT 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: NLS 

i 

' <400> 31 

30 Glu Glu Asp Gly Pro Gin Lys Lys Lys Arg Arg Leu 
! 1 5 10 



; <210> 32 

;35 <211> 8 
; <212> PRT 

* <213> Artificial Sequence 

<220> 

40 <223> Description of Artificial Sequence: NLS 
<4 00> 32 

Ala Pro Thr Lys Arg Lys Gly Ser 
1 5 

45 

<210> 33 
<211> 7 
<212> PRT 
50 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 

55 <400> 33 

Pro Asn Lys Lys Lys Arg Lys 
1 5 



60 <210> 34 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
65 <220> 

<223> Description of Artificial Sequence: NLS 



"Si u ' ■ , 



10 



55 



60 



65 



68 



<400> 34 

Lys Arg Pro Arg Pro 

1 ' 5 



<210> 35 
<211> 11 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: NLS 



<400> 35 

15 Cys Gly Gly Leu Ser Ser Lys Arg Pro Arg Pro 
1 5 10 



<210> 36 
<211> 19 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 
<400> 36 

Pro Pro Lys Lys Arg Met Arg Arg Arg lie Glu Pro Lys Lys Lys Lys 
1 5 ' 10 " 15 



Lys Arg Pro 



15 <210> 37 

j <2ii> n 

. <212> PRT 

1 <213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: NLS 
<400> 37 

Pro Phe Leu Asp Arg Leu Arg Arg Asp Gin Lys 
45 1 5 10 



<210> 38 
<211> 9 
50 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: NLS 

<400> 38 

Pro Lys Gin Lys Arg Lys Met Ala Arg 
1 5 



<210> 39 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 



69 



<400> 39 

Ser Val Thr Lys Lys Arg Lys Leu Glu 
1 5 

5 

<210> 40 
<211> 11 
<212> PRT 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 

15 <400> 40 

Cys Gly Gly Ala Ala Lys Arg Val Lys Leu Asp 
1 5 10 



iO <210> 41 

<211> 9 
i <212> PRT 

I <213> Artificial Sequence 

15 <220> 

<223> Description of Artificial Sequence: NLS 
! <400> 41 

I Pro Ala Ala Lys Arg Val Lys Leu Asp 
!0 1 5 



i <210> 42 
i <211> 11 
'5 <212> PRT 

! <213> Artificial Sequence 
' <220> 

<223> Description of Artificial Sequence: NLS 

40 

<400> 42 

Arg Gin Arg Arg Asn Glu Leu Lys Arg Ser Pro 
1 5 10 

45 

<210> 43 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence: NLS 
<400> 43 

55 Pro Gin Ser Arg Lys Lys Leu Arg 
1 5 



<210> 44 
60 <211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

65 . <223> Description of Artificial Sequence: NLS 
<400> 44 



70 



Pro Leu Leu Lys Lys lie Lys Gin 
1 5 



5 <210> 45 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: NLS 
<400> 45 

Pro Gin Pro Lys Lys Lys Pro 
15 1 5 



<210> 46 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 
<400> 46 

Ser Lys Arg Val Ala Lys Arg Lys Leu 
1 5 



<210> 47 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 



S5 



<4 00> 4 7 

40 Ala Ser Lys Ser Arg Lys Arg Lys Leu 
1 5 



<210> 48 
45 <211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: NLS 
<400> 48 

Gly Gly Leu Cys Ser Ala Arg Leu His Arg His Ala Leu Leu Ala Thr 
15 10 15 

55 

<210> 49 
<211> 8 

<212> PRT ; 
60 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 

65. <400> 49 

Arg Lys Thr Lys Lys Lys lie Lys 
1 5 



.,jl„ :«;!!„:«:„« !l-i£!' 



•71 



10 



15 



:0 



40 



<210> 50 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: NLS 
<400> 50 

Arg Lys Leu Lys Lys Leu Gly Asn 
1 5 



<210> 51 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: NLS 



<400> 51 

15 Arg Lys Asp Arg Arg Gly Gly Arg 

; 1 5 

' <210> 52 
JO <211> 18 

<212> PRT 
I <213> Artificial Sequence 

; <220> 

35 <223> Description of Artificial Sequence: NLS 
<400> 52 

i Asp Thr Arg Glu Lys Lys Lys Phe Leu Lys Arg Arg Leu Leu Arg Leu 

1 5 10 15 



Asp Glu 



45 <210> 53 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: NLS 
<400> 53 

Pro Lys Lys Lys Arg Lys Val 
55 1 5 



<210> 54 

<211> 1410 

60 <212> DNA 

<213> Bacteriophage R4 

<220> 
<221> CDS 
'65. <222> (1) . . (1407) 



<400> 54 



:P Kin { H-fv^n ,A <i j. no l 



10 



50 



I 



40 



50 



60 



65 



■ 72 . v . 

atg aat cga ggg ggg C cc act gta egg gec gac ate tac gtc cga ate 4 8 

Met Asn Arg Gly Gly Pro Thr Val Arg Ala Asp He Tyr Val Arg He 
1 5 10 is 

age ctg gac cgc aca ggg gaa gag etc ggg gtc gag cgc cag gag gag 96 
Ser Leu Asp Arg Thr Gly Glu Glu Leu Gly Val Glu Arg Gin Glu Glu 
20 25 30 

teg tgt cgc gag etc tgc aag age etc ggc atg gag gtg ggg cag gtg 144 
Ser Cys Arg Glu Leu Cys Lys Ser Leu Gly Met Glu Val Gly Gin Val 
35 40 45 



tgg gtc gac aac gac ctg age gec acc aag aag aac gtc gtc cgc cct 192 
Trp Val Asp Asn Asp Leu Ser Ala Thr Lys Lys Asn Val Val Arg Pro 
15 50 55 60 

gac ttc gag gcg atg ate gcg age aac ccg cag gcg ate gtc tgc tgg 240 
Asp Phe Glu Ala Met He Ala Ser Asn Pro Gin Ala He Val Cys Trp 
' 65 -70 75 80 

cac acc gac egg etc ate cgc gtc acg egg gac ctg gag egg gtg ate 288 
; Hxs Thr Asp Arg Leu He Arg Val Thr Arg Asp Leu Glu Arg Val He 

85 90 95 

i5 gac etc gga gtc aac gtc cac gee gtg atg gee gga cac ctg gac ctg 336 
j Asp Leu Gly Val Asn Val His Ala Val Met Ala Gly His Leu Asp Leu 
j 100 105 110 

tec acc ccg gee ggc cga gec gtc gee cgc acg gtg acg gec tgg gec 384 
Ser Thr Pro Ala Gly Arg Ala Val Ala Arg Thr Val Thr Ala Trp Ala 
115 120 125 

acg tac gag ggc gag cag aag get gag cgc cag aag etc gee aac ate 432 
Thr Tyr Glu Gly Glu Gin Lys Ala Glu Arg Gin Lys Leu Ala Asn lie 



35 130 135 



140 



eag aac gec cgc gee ggc aag ccg tac acc ccc ggc ate cgc ccc ttc 480 
Gin Asn Ala Arg Ala Gly Lys Pro Tyr Thr Pro Gly He Arg Pro Phe 
145 150 i 5 5 



160 



ggg tac ggc gac gac cac atg acc ate gtg acg gec- gag gcg gac gec 
Gly Tyr Gly Asp Asp His Met Thr He Val Thr Ala Glu Ala Asp Ala 
165 170 175 ' 



180 185 ~ 190 

gtg get cgc tac tgg gag gag etc aag etc cag teg ccc egg agt atg 

Val Ala Arg Tyr Trp Glu Glu Leu Lys Leu Gin Ser Pro Arg Ser Met 

I 95 200 205 



55 210 ' " 215 



220 



225 230 



235 ' 240 



528 



45 ate cgc gac ggc gcg aag atg ate etc gac ggc tgg tec ctg teg gec 576 
He Arg Asp Gly Ala Lys Met He Leu Asp Gly Trp Ser Leu Ser Ala 



624 



gec gca ggc ggc aag ggc tgg tct ctg egg ggc gta aag aag gtg ctg 672 
Ala Ala Gly Gly Lys Gly Trp Ser Leu Arg Gly Val Lys Lys Val Leu 



acc tec ccg cgc tac gtc ggg egg tec age tac etc ggg gag gtc gtg 720 
Thr Ser Pro Arg Tyr Val Gly Arg Ser Ser Tyr Leu Gly Glu Val Val 



768 



ggc gat get cag tgg ccg ccc ate etc gac ccg gac gtc tac tac ggg 

Gly Asp Ala Gin Trp Pro Pro He Leu Asp Pro Asp Val Tyr Tyr Gly 

245 250 255 

(T? ?^ a ^ c ctg aac aac ccc gac C 9 C ttc ggg gg c c =t cgg sie 

Val Val Ala He Leu Asn Asn Pro Asp Arg Phe Ser Gly Gly Pro Ara 

260 265 " 270 



73 



acc ggc cgc acc ccc ggc acg ctg etc gca ggc ate gec ttg tgc ggt 864 

Thr Gly Arg Thr Pro Gly Thr Leu Leu Ala Gly lie Ala Leu Cys Gly 

275 280 285 

5 

gag tgc ggc aag acg gtc agt gga cgc ggc tac cga ggt gtc ctg gtc 912 

Glu Cys Gly Lys Thr Val Ser Gly Arg Gly Tyr Arg Gly Val Leu Val 
290 295 300 

10 tac gga tgt aag gac acg cac act egg acg cct egg age ate get gac 960 

Tyr Gly Cys Lys Asp Thr His Thr Arg Thr Pro Arg Ser lie Ala Asp 

305 310 315 . 320 

ggc cgc gcg age age teg acc etc gec egg etc atg ttc ccc gac ttc 1008 

15 Gly Arg Ala Ser Ser Ser Thr Leu Ala Arg Leu Met Phe Pro Asp Phe 

325 330 335 

ctg ccc ggc etc ctg gee tct ggg cag gec gag gac ggc cag teg gca 1056 

Leu Pro Gly Leu Leu Ala Ser Gly Gin Ala Glu Asp Gly Gin Ser Ala 

3 340 345 350 

! 

gca tec aag cac teg gag gec cag acg ctg cgc gag cgc ctt gac ggg 1104 

j Ala Ser Lys His Ser Glu Ala Gin Thr Leu Arg Glu Arg Leu Asp Gly 

355 360 365 

'5 

ctg get acg gec tae gcg gag ggt gcg ate age ctg tct cag atg acg 1152 

■ Leu Ala Thr Ala Tyr Ala Glu Gly Ala lie Ser Leu Ser Gin Met Thr 
i 370 375 380 

i0 gee ggc teg gaa gca ctg egg aag aag ctg gag gtg ate gaa gec gac 1200 

■ Ala Gly Ser Glu Ala Leu Arg Lys Lys Leu Glu Val lie Glu Ala Asp 
. 385 390 395 400 

etc gtg ggc teg gca ggc ate ccg ccc ttc gat cca gtg gec gga gtg 1248 

35 Leu Val Gly Ser Ala Gly lie Pro Pro Phe Asp Pro Val Ala Gly Val 

| 405 410 415 

get ggc ctg ate tec ggc tgg ccc acc acg cct etc ccg acg cgt cga 1296 

Ala Gly Leu lie Ser Gly Trp Pro Thr Thr Pro Leu Pro Thr Arg Arg 

40 420 425 430 

gca tgg gtg gac ttc tgc ctg gtg gtc acg ctg aac acc cag aag ggg 1344 

Ala Trp Val Asp Phe Cys Leu Val Val Thr Leu Asn Thr Gin Lys Gly 

435 440 ■ 445 



45 



cgc cat gcg teg age atg acc gtg gac gac cac gtc acc ate gag tgg 1392 
Arg His Ala Ser Ser Met Thr Val Asp Asp His Val Thr lie Glu Trp 
450 455 460 



50 cga gac gtg gee gag tag 1410 
Arg Asp Val Ala Glu 
4 65 

55 <210> 55 
<211> 469 
<212> PRT 

<213> Bacteriophage R4 
60 <400> 55 

Met Asn Arg Gly Gly Pro Thr Val Arg Ala Asp lie Tyr Val Arg lie 
15 10 15 

Ser Leu Asp Arg Thr Gly Glu Glu Leu Gly Val Glu Arg Gin Glu Glu 
65 . 20 25 30 



Ser Cys Arg Glu Leu Cys Lys Ser Leu Gly Met Glu Val Gly Gin Val 



'11 'j H !• 1 M i 11- !"!'!! „. l! II I! Si ' !! " 1 



74 



• 



35 



40 



45 



Trp Val Asp Asn Asp Leu Ser Ala Thr Lys Lys Asn Val Val Arg Pro 

50 55 ' 60 



Asp Phe Glu Ala Met lie Ala Ser Asn Pro Gin Ala lie Val Cys Trp 
65 70 75 80 



His Thr Asp Arg Leu He Arg 
10 85 



Val Thr Arg Asp Leu Glu Arg Val He 
90 95 



Asp Leu Gly Val Asn Val His 
100 



15 Ser Thr 



Pro Ala Gly Arg Ala 
115 



Ala Val Met Ala Gly. His Leu Asp Leu 
105 110 

Val Ala Arg Thr Val Thr Ala Trp Ala 
120 125 



Thr Tyr Glu Gly Glu Gin Lys 
. 130 135 



Ala Glu Arg Gin Lys Leu Ala Asn He 
14 0 



Gin Asn Ala Arg Ala Gly Lys 
145 150 



Pro Tyr Thr Pro Gly He Arg Pro Phe 
155 160 



■ Gly Tyr Gly Asp Asp His Met 
!5 165 

' He Arg Asp Gly Ala Lys Met 
180 

30 Val Ala Arg Tyr Trp Glu Glu 

i 195 



Thr He Val Thr Ala Glu Ala Asp Ala 

170 175 

He Leu Asp Gly Trp Ser Leu Ser Ala 
185 190 

Leu Lys Leu Gin Ser Pro Arg Ser Met 
200 205 



35 



I i. 



; 40 



Ala Ala Gly Gly Lys Gly Trp 
210 215 

Thr Ser Pro Arg Tyr Val Gly 
225 230 

Gly Asp Ala Gin Trp Pro Pro 
245 



Ser Leu Arg Gly Val Lys Lys Val Leu 
220 

Arg Ser Ser Tyr Leu Gly Glu Val Val 

235 240 

He Leu Asp Pro Asp Val Tyr Tyr Gly 

250 * " 255 



50 



55 



Val Val Ala He Leu Asn Asn 
260 



45 Thr Gly 



Glu Cys 
290 

Tyr Gly 
305 



Arg Thr Pro Gly Thr 

275 

Gly Lys Thr Val Ser 
295 

Cys Lys Asp Thr His 
310 



Gly Arg Ala Ser Ser Ser Thr 
325 



Pro Asp Arg Phe Ser Gly Gly Pro Arg 
265 270 

Leu Leu Ala Gly He Ala Leu Cys Gly 
280 285 

Gly Arg Gly Tyr Arg Gly Val Leu Val 
300 

Thr Arg Thr Pro Arg Ser He Ala Asp 

315 320 

Leu Ala Arg Leu Met Phe Pro Asp Phe 
330 335 



65 



Leu Pro Gly Leu Leu Ala Ser 
340 



60 Ala Ser 



Leu Ala 
370 

Ala Gly 
385 



Lys His Ser Glu Ala 
355 

Thr Ala Tyr Ala Glu 
375 

Ser Glu Ala Leu Arg 
390 



Gly Gin Ala Glu Asp Gly Gin Ser Ala 
345 350 

Gin Thr Leu Arg Glu Arg Leu Asp Gly 
360 365 

Gly Ala He Ser Leu Ser Gin Met Thr 
380 

Lys Lys Leu Glu Val He Glu Ala Asp 
395 400 



75 



10 



Leu Val Gly Ser Ala Gly lie Pro Pro Phe Asp Pro Val Ala Gly Val 
405 410 " 415 

Ala Gly Leu lie Ser Gly Trp Pro Thr Thr Pro Leu Pro Thr Arg Arg 
420 425 430 

Ala Trp Val Asp Phe Cys Leu Val Val Thr Leu Asn Thr Gin Lys Gly 
4 35 4 40 445 

Arg His Ala Ser Ser Met Thr Val Asp Asp His Val Thr He Glu Trp 
450 455 * 460- 



Arg Asp Val Ala Glu 
15 465 



<210> 56 
70 <211> 1503 
! <212> DNA 

<213> CisA recombinase 

<220> 
15 <221> CDS 
i <222> (1) . . (1500) 

<400> 56 

gtg ata gca ata tat gta agg gta teg ace gag gaa caa gcg ate aag 48 

SO Val He Ala He Tyr Val Arg Val Ser Thr Glu Glu Gin Ala He Lys 
11 5 10 15 

j gga teg age ate gac age caa ate gag gec tgt ata aag aaa gca ggg 96 

Gly Ser Ser lie Asp Ser Gin He Glu Ala Cys lie Lys Lys Ala Gly 
55 20 25 30 

i 

! act aaa gat gtg ctg aag tat gca gat gaa gga ttt tea gga gag ctt 144 
j Thr Lys Asp Val Leu Lys Tyr Ala Asp Glu Gly Phe Ser Gly Glu Leu 
! 35 40 * 4 5 

iO 

tta gaa cgt ccg get ttg aat cgc ttg agg gag gat gca age aag gga 192 
Leu Glu Arg Pro Ala Leu Asn Arg Leu Arg Glu Asp Ala Ser Lys Gly 
50 " 55 60 

45 ctt ata agt caa gtc att tgt tac gat.cct gac cgt ctt tct egg aaa 240 
Leu He Ser Gin Val He Cys Tyr Asp Pro Asp Arg Leu Ser Arg Lys 
65 70 75 80 

tta atg aat cag eta ate att gat gac gaa ttg cga aag cga aac ata 288 
50 Leu Met Asn Gin Leu lie lie Asp Asp Glu Leu Arg Lys Arg Asn He 
.85 90 95 

cct ttg att ttt gta aat ggt gaa tac gec aat tct cca gaa ggt caa 336 
Pro Leu He Phe Val Asn Gly Glu Tyr Ala Asn Ser Pro Glu Gly Gin 
55 100 105 110 

ttg ttt ttc gca atg cgc ggg gca ate tea gaa ttt gaa aaa gec aaa 384 
Leu Phe Phe Ala Met Arg Gly Ala He Ser Glu Phe Glu Lys Ala Lys 
115 120 125 

60 

ate aaa gaa egg aca tea age ggc cga ctt caa aaa atg aaa aaa ggc 4 32 
He Lys Glu Arg Thr Ser Ser Gly Arg Leu Gin Lys Met Lys Lys Gly 
130 135 " 140 

65 . atg ate att aaa gat tct aaa eta tat ggc tat aaa ttt gtt aaa gag 480 
Met He He Lys Asp Ser Lys Leu Tyr Gly Tyr Lys Phe Val Lys Glu 
145 150 " 155 * 160 



76 



aaa aga act ctt gag ata tta gaa gag gaa gca aaa ate att egg atg 528 

Lys Arg Thr Leu Glu lie Leu Glu Glu Glu Ala Lys lie lie Arg Met 

165 170 175 

5 

att ttt aac tat ttc acc gat cat aaa age cct ttt ttc ggc aga gta 576 

lie Phe Asn Tyr Phe Thr Asp His Lys Ser Pro Phe Phe Gly Arg Val 

180 185 190 

10 aat ggt att get eta cat tta act cag atg ggg gtt aaa aca aaa aaa 624 
Asn Gly lie Ala Leu His Leu Thr Gin Met Gly Val Lys Thr Lys Lys 
195 200 .205 

ggc gee aaa gta tgg cac agg cag gtt gtt egg caa ata tta atg aac 672 
15 Gly Ala Lys Val Trp His Arg Gin Val Val Arg Gin lie Leu Met Asn 
210 215 220 

tct tec tat aag ggt gaa cat aga cag tat aaa tat gat aca gag ggt 720 
Ser Ser Tyr Lys Gly Glu His Arg Gin Tyr Lys Tyr Asp Thr Glu Gly 
20 225 230 235 240 

tec tat gtt tea aag cag gca ggg aac aaa tct ata att aaa ata agg 768 
Ser Tyr Val Ser Lys Gin Ala Gly Asn Lys Ser lie lie Lys lie Arg 
245 250 255 



25 



45 



65 



cct gaa gaa gaa caa ate act gtg aca att cca gca att gtt cca get 816 
Pro Glu Glu Glu Gin He Thr Val Thr He Pro Ala He Val Pro Ala 
260 265 270 



30 gaa caa tgg gat tat get caa gaa etc tta ggt caa agt aaa aga aaa 8 64 
; Glu Gin Trp Asp Tyr Ala Gin Glu Leu Leu Gly Gin Ser Lys Arg Lys 

j 275 280 285 

i 

cac ttg agt ate age cct cac aat tac ttg tta teg ggt ttg gtt aga 912 
35 His Leu Ser He Ser Pro His Asn Tyr Leu Leu Ser Gly Leu Val Arg 
290 295 " 300 

tgc gga aaa tgc gga aat acc atg aca ggg aag aaa aga aaa tea cat 960 
Cys Gly Lys Cys Gly Asn Thr Met Thr Gly Lys Lys Arg Lys Ser His 
40 305 310 315 320 

ggt aaa gac tac tat gta tat act tgc egg aaa aat tat tct ggc gca 1008 
Gly Lys Asp Tyr Tyr Val Tyr Thr Cys Arg Lys Asn Tyr Ser Gly Ala 
325 330 335 



aag gac cgc ggc tgc gga aaa gaa atg tct gag aat aaa ttg aac egg 1056 
Lys Asp Arg Gly Cys Gly Lys Glu Met Ser Glu Asn Lys Leu Asn Arg 
340 345 350 



50 cat gta tgg ggt gaa att ttt aaa ttc ate aca aat cct caa aag tat 1104 
His Val Trp Gly Glu He Phe Lys Phe He Thr Asn Pro Gin Lys Tyr 
355 360 365 

gtt tct ttt aaa gag get gaa caa tea aat cac ctg tct gat gaa tta 1152 
55 Val Ser Phe Lys Glu Ala Glu Gin Ser Asn His Leu Ser Asp Glu Leu 
370 375 380 

gaa ctt att gaa aaa gag ata gag aaa aca aaa aaa ggc cgc aag cgt 1200 
Glu Leu He Glu Lys Glu He Glu Lys Thr Lys Lys Gly Arg Lys Arg 
60 385 390 395 ' ' 400 

ctt tta acg eta ate age eta age gat gac gat gat tta gac ata gat 1248 
Leu Leu Thr Leu He Ser Leu Ser Asp Asp Asp Asp Leu Asp He Asp 
405 410 415 



gaa ate aaa gca caa att att gaa ctg caa aaa aag caa aat cag ctt 1296 
Glu He Lys Ala Gin He He Glu Leu Gin Lys Lys Gin Asn Gin Leu 



10 



I 

35 



50 



65 



77 



420 425 430 

act gaa aag tgt aac aga ate cag tea aaa atg aaa gtc eta gat gat 1344 
Thr Glu Lys Cys Asn Arg lie Gin Ser Lys Met Lys Val Leu Asp Asp 
435 ' 440 445 

acg age tea agt gaa aat get eta aaa aga gee ate gac tat ttt caa 1392 
Thr Ser Ser Ser Glu Asn Ala Leu Lys Arg Ala lie Asp Tyr Phe Gin 
450 455 460 

tea ate ggt gca gat aac tta act ctt gaa gat aaa aaa aca att gtt " 1440 
Ser lie Gly Ala Asp Asn Leu Thr Leu Glu Asp Lys. Lys Thr lie Val 
465 ' 470 475 480 



15 aac ttt ate gtg aaa gaa gtt acc att gtg gat tct gac acc ata tat 1488 
Asn Phe lie Val Lys Glu Val Thr lie Val Asp Ser Asp Thr lie Tyr . 

485 490 495 

att gaa acg tat taa 1503 
1 20 He Glu Thr Tyr 
500 

<210> 57 
25 <211> 500 
! <212> PRT 

! <213> CisA recombinase 

I 

'; <400> 57 

30 Val He Ala He Tyr Val Arg Val Ser Thr Glu Glu Gin Ala He Lys 
15 10 15 



Gly Ser Ser He Asp Ser Gin He Glu Ala Cys He Lys Lys Ala Gly 
20 25 ' : . 30 

Thr Lys Asp Val Leu Lys Tyr Ala Asp Glu Gly Phe Ser Gly Glu Leu 
35 40 45 



Leu Glu Arg Pro Ala Leu Asn Arg Leu Arg Glu Asp Ala Ser Lys Gly 
40 50 55 60 

Leu lie Ser Gin Val He Cys Tyr Asp Pro Asp Arg Leu Ser Arg Lys 
65 70 75 80 

45 Leu Met Asn Gin Leu He He Asp Asp Glu Leu Arg Lys Arg Asn He 

85 90 95 



Pro Leu He Phe Val Asn Gly Glu Tyr Ala Asn Ser Pro Glu Gly Gin 
100 105 110 

Leu Phe Phe Ala Met Arg Gly Ala lie Ser Glu Phe Glu Lys Ala Lys 
115 120 125 



He Lys Glu Arg Thr Ser Ser Gly Arg Leu Gin Lys Met Lys Lys Gly 
55 130 135 140 

Met He He Lys Asp Ser Lys Leu Tyr Gly Tyr Lys Phe Val Lys Glu 
145 150 155 * . 160 

60 Lys Arg Thr Leu Glu He Leu Glu Glu Glu Ala Lys lie lie Arg Met 

165 170 175 



He Phe Asn Tyr Phe Thr Asp His Lys Ser Pro Phe Phe Gly Arg Val 

180 185 190 

Asn Gly He Ala Leu His Leu Thr Gin Met Gly Val Lys Thr Lys Lys 

195 200 205 




78 



Gly Ala Lys Val Trp His Arg Gin Val Val Arg Gin lie Leu Met Asn 

210 215 220 

Ser Ser Tyr Lys Gly Glu His Arg Gin Tyr Lys Tyr Asp Thr Glu Gly 

225 230 235 ' ' 240 

Ser Tyr Val Ser Lys Gin Ala Gly Asn Lys Ser lie lie Lys lie Arg 

245 250 " 255 

Pro Glu Glu Glu Gin He Thr Val Thr He Pro Ala He Val Pro Ala 

260 265 . 270 

Glu Gin Trp Asp Tyr Ala Gin Glu Leu Leu Gly Gin Ser Lys Arg Lys 

275 * 280 " 285 

His Leu Ser He Ser Pro His Asn Tyr Leu Leu Ser Gly Leu Val Arg 

290 295 300 

Cys Gly Lys CysGly Asn Thr Met Thr Gly Lys Lys Arg Lys Ser His 

305 310 315 " 320 

Gly Lys Asp Tyr Tyr Val Tyr Thr Cys Arg Lys Asn Tyr Ser Gly Ala 

325 330 ' 335 

Lys Asp Arg Gly Cys Gly Lys Glu Met Ser Glu Asn Lys Leu Asn Arg 

340 345 " 350 

His Val Trp Gly Glu He Phe Lys Phe He Thr Asn Pro Gin Lys Tyr 

355 360 365 

Val Ser Phe Lys Glu Ala Glu Gin Ser Asn His Leu Ser Asp Glu Leu 

370 375 380 

Glu Leu He Glu Lys Glu He Glu Lys Thr Lys Lys Gly Arg Lys Arg 

385 390 395 ~ " 400 

Leu Leu Thr Leu He Ser Leu Ser Asp Asp Asp Asp Leu Asp He Asp 

405 410 ' * 415 

Glu He Lys Ala Gin lie lie Glu Leu Gin Lys Lys Gin Asn Gin Leu 

420 425 ' 430 

Thr Glu Lys Cys Asn Arg lie Gin Ser Lys Met Lys Val Leu Asp Asp 

435 ' 440 445 

Thr Ser Ser Ser Glu Asn Ala Leu Lys Arg Ala lie Asp Tyr Phe Gin 

450 455 " 460 

Ser He Gly Ala Asp Asn Leu Thr Leu Glu Asp Lys Lys Thr lie Val 

465 470 475 " ' 480 

Asn Phe lie Val Lys Glu Val Thr lie Val Asp Ser Asp Thr lie Tyr . 

4 85 4 90 ' 4 95 

He Glu Thr Tyr 



500 



<210> 58 
<211> 1545 
<212> DNA 

<213> XisF recombinase 



<220> 
<221> CDS 




<222> (1) . . (1542) 

<400> 58 

atg gaa aat tgg ggt tac gcg aga gtg age ggt gag gaa cag caa aca 48 
5 Met Glu Asn Trp Gly Tyr Ala Arg Val Ser Gly Glu Glu Gin Gin Thr 
1 5 10 15 

gat aaa ggt gcg ttg cgt aaa caa ata gaa cgc ttg cgt aat get gga 96 
Asp Lys Gly Ala Leu Arg Lys Gin lie Glu Arg Leu Arg Asn Ala Gly 
10 - - 2Q 25 30 

tgt tea aaa gtg tac tgg gat att caa teg egg aca. act gaa gtc aga 14 4 

Cys Ser Lys Val Tyr Trp Asp lie Gin Ser Arg Thr Thr Glu Val Arg 

35 40 ' 45 

15 

gaa ggg eta caa caa tta att aat gac tta aag aca tct tea aca ggt 192 

Glu Gly Leu Gin Gin Leu lie Asn Asp Leu Lys Thr Ser Ser Thr Gly 

50 55 60 

aag gta aaa tea ctg caa ttt acc cgc att gat cgc ate ggc tea tea 240 
Lys Val Lys Ser Leu Gin Phe Thr Arg lie Asp Arg lie Gly Ser Ser 
65 70 75 80 

i 

i 

■ teg egg ttg ttt tat tea ttg tta gag gta tta cgt tec aag gga att 288 
!5 Ser Arg Leu Phe Tyr Ser Leu Leu Glu Val Leu Arg Ser Lys Gly lie 
i 85 90 95 

; aaa ctg ata gee tta gat caa ggc gtt gac cca gac age ctt ggc ggg 33 6 
Lys Leu lie Ala Leu Asp Gin Gly Val Asp Pro Asp Ser Leu Gly Gly 

i0 ioo ' 105 " ' no 

' gaa eta aca att gat atg tta ctg get get gec aaa ttt gag gta aga 384 

Glu Leu Thr lie Asp Met Leu Leu Ala Ala Ala Lys Phe Glu Val Arg 
115 120 125 



35 



55 



atg gtg acg gag agg tta aaa age gaa cgt cgt cat agg gtg aac caa 432 
Met Val Thr Glu Arg Leu Lys Ser Glu Arg Arg His Arg Val Asn Gin 
130 135 " 140 



40 gga aaa agt cac cga gtt gee cca tta gga tac cgc aaa gat aaa gat 480 
Gly Lys Ser His Arg Val Ala Pro Leu Gly Tyr Arg Lys Asp Lys Asp 
145 150 155 • ' 160 

aaa tat ata cgc gat cgc tea cca tgt gtt tgc tta eta gaa gga cgc 528 
45 Lys Tyr lie Arg Asp Arg Ser Pro Cys Val Cys Leu Leu Glu Gly Arg 

165 " 170 " 175 

aga gaa tta acg gtg tct gac tta gec cag tat att ttt cac act ttt 576 
Arg Glu Leu Thr Val Ser Asp Leu Ala Gin Tyr lie Phe His Thr Phe 
50 180 185 190 

ttt gag tgc ggt tec gtt get get act gtg cgt aag ctg cac tea gat 624 
Phe Glu Cys Gly Ser Val Ala Ala Thr Val Arg Lys Leu His Ser Asp 
195 200 205 



ttt ggt ata gaa aca aaa gtt ctg aat tgg aac aag eta gaa aaa tct 672 
Phe Gly lie Glu Thr Lys Val Leu Asn Trp Asn Lys Leu Glu Lys Ser 
210 215 220 



60 tec egg att gtt ggc gac gac gac tta gat aaa att gca ttt aca cca 720 
Ser Arg lie Val Gly Asp Asp Asp Leu Asp Lys lie Ala Phe Thr Pro 
225 230 ' 235 240 

aat aaa act aac cac ccc ttg cgt tat ccc tgg tct ggg eta aga tgg 768 
65 . Asn Lys Thr Asn His Pro Leu Arg Tyr Pro Trp Ser Gly Leu Arg Trp 

245 250 * " 255 






tea 
Ser 


ate 
He 


cct 
Pro 


ggt 
Gly 
260 


tta 
Leu 


aaa 
Lys 


gcg 
Ala 


tta 
Leu 


tta 
Leu 

2 65 


gtt 

Val 


aac 
Asn 


cct 
Pro 


gtt 

Val 


tac 
Tyr 

270 


gee 
Ala 


ggg 

Gly 


816 


5 


ggt 
Lily 


ttg 
Leu 


ccc 
Pro 
275 


ttt 
Phe 


gat 
Asp 


act 
Thr 


tac 
Tyr 


gtt 
Val 
280 


aaa 
Lys 


tea 
Ser 


aaa 
Lys 


gga 
bly 


aaa 
Lys 

285 


cgc 
Arg 


aag 
Lys 


cat 
His 


364 


1 0 


ttt 

Phe 


gac 
Asp 
290 


gag 

blU 


tgg 
Trp 


aaa 
Lys 


gta 
Va 1 


aaa 
Lys 
295 


tgg 
Trp 


gga 
Gly 


ace 
Thr 


cac 

rilS 


gac 
Asp 
300 


gat 
Asp 


gag 

CjXU 


gca 
A±a 


ate 

Tl Q 

ixe 


912 


15 


att 
lie 
305 


acc 
Thr 


tgt 
Cys 


gag 

GlU 


gaa 
Glu 


cat 
His 
310 


gaa 
Glu 


aga 
Arg 


ata aaa 
He Lys 


cag 
bin 
315 


atg 
Met 


att 
ixe 


cga 
Arg 


gac 
Asp 


aat 
Asn 
320 


960 


t 

0 


cgc 
Arg 


aat 
Asn 


aat 
Asn 


cga 
Arg 


tgg 
Trp 
325 


get 
Ala 


gca 
AXa 


aga 
Arg 


gaa 
Glu 


gaa 
Glu 
330 


aac 
Asn 


gaa 
pi„ 

(jXU 


gta 
vax 


aac 
Asn 


cca 
Pro 
335 


ttt 
Phe 


1008 




tct 
Ser 


aat 
Asn 


tta 
Leu 


ctt 
Leu 
340 


aaa 
Lys 


tgt 
Cys 


acc 
1 nr 


cat 
His 


tgc 
Cys 
345 


ggc 
Gly 


ggc 

pi,. 
bxy 


tea 
Ser 


atg 
Met 


aca 
i nr 

350 


cgc 
Arg 


cac 

HIS 


1056 


5 

| 


gec 
ax a 


aaa 
Lys 


cgt 
Arg 
355 


gta 
Val 


gat 
Asp 


aag 
Lys 


agt 
ber 


gga caa get 
Gly Gin Ala 
360 


ate 

Ti- 
ne 


tat 
Tyr 


tat 
Tyr 
365 


tat 
Tyr 


cag 
bin 


tgc 
Cys 


1104 


0 

j 


cga 
Arg 


ttg 
Leu 
370 


tat 
Tyr 


aaa 
Lys 


get 
Ala 


ggc 
Gly 


aac 
Asn 
375 


tgt 
Cys 


age 
Ser 


aat 
Asn 


aaa 
Lys 


aat 
Asn 
380 


atg 
Met 


att 
He 


tea 
Ser 


tec 
Ser 


1152 


5 


aaa 
Lys 
385 


ata 
lie 


tta 
Leu 


gat 
Asp 


ate 
He 


caa 
Gin 
390 


gta 
Val 


atg 
Met 


gat 
Asp 


tta 
Leu 


ttg 
Leu 

395 


gca 
Ala 


caa 
Gin 


gaa 
Gi u 


gec 
Ala 


gaa 
Glu 
400 


1200 


i 

40 


cgt 
Arg 


tta 
Leu 


gca 
Ala 


aat 
Asn 


ttg 
Leu 
405 


gtg 
Val 


gaa 
Glu 


aca 
Thr 


gat 
Asp 


gag 
Glu 
410 


ccg 
Pro 


ctt 
Leu 


att 
He 


gta 
Val 


gaa 
Glu 
415 


gaa 
Glu 


1248 




ccc 
Pro 


cca 
Pro 


gaa 
Glu 


gta 
Val 
420 


aaa 
Lys 


acg 
Thr 


ctg 
Leu 


cgc gca 
Arg Ala 
425 


tec 
Ser 


ctg 
Leu 


aat 
Asn 


agt 
Ser 


ctg 
Leu 
430 


gaa 
Glu 


aca 
Thr 


1296 


45 


ttg 
Leu 


cca 
Pro 


gca 
Ar a 
435 


agt 
Ser 


tea 

Ser 


gca 
AX a 


att 

lie 


gaa 
Glu 
440 


caa 
Gin 


att 
lie 


aaa 
Lys 


aat 
Asn 


gac 
Asp 
445 


etc 
Leu 


aaa 
Lys 


gaa 
Glu 


1344 


50 


cag 
Gin 


att 
He 
450 


gcg 
Ala 


ate 
He 


gca 
Ala 


eta 
Leu 


gga 
Gly 
455 


gca 
Ala 


acc 
Thr 


aat 
Asn 


aat 
Asn 


get 
Ala 
460 


tct 
Ser 


aaa 
Lys 


caa 
Gin 


tct 
Ser 


1392 


55 


ctg 
Leu 
465 


att 
He 


gec 
Ala 


aag 
Lys 


gaa 
Glu 


aga 
Arg 
470 


att 
He 


ata 

He 


caa 
Gin 


get 
Ala 


ttt 
Phe 
475 


get 
Ala 


cat 
His 


aaa 
Lys 


agt 

Ser 


tac 
Tyr 
480 


1440 


60 


tgg 
Trp 


caa 
Gin 


gga 
Gly 


eta 
Leu 


aac 
Asn 
485 


get 
Ala 


caa 
Gin 


gat 
Asp 


aaa 
Lys 


cga 
Arg 
490 


gca 
Ala 


ate 
He 


etc 
Leu 


aat 
Asn 


ggt 
Gly 
495 


tgc 
Cys 


1488 




gta 
Val 


aaa 
Lys 


aaa 
Lys 


ate 
He 
500 


tec 
Ser 


gta 
Val 


gat 
Asp 


ggt aac 
Gly Asn 

505 


ttt 
Phe 


gtt 

Val 


aca 
Thr 


get 
Ala 


att 
He 
510 


gag 
Glu 


tat 
Tyr 


1536 


65 


cgt 
Arg 


tac 
Tyr 


tag 
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<210> 59 
<211> 514 
<212> PRT 

<213> XisF recombinase 



10 



15 



:0 



!5 



iO 



55 



40 



45 



50 



55 



60 



65 



<400> 59 

Met Glu Asn Trp Gly Tyr Ala 
1 5 

Asp Lys Gly Ala Leu Arg Lys 
20 

Cys Ser Lys Val Tyr Trp Asp 
35 

Glu Gly Leu Gin Gin Leu lie 
50 55 

Lys Val Lys Ser Leu Gin Phe 
65 70 

Ser Arg Leu Phe Tyr Ser Leu 
85 

Lys Leu lie Ala Leu Asp Gin 
100 

Glu Leu Thr lie Asp Met Leu 
115 

Met Val Thr Glu Arg Leu Lys 
130 135 

Gly Lys Ser His Arg Val Ala 
145 150 

Lys Tyr lie Arg Asp Arg Ser 
165 

Arg Glu Leu Thr Val Ser Asp 

180 • •"■ • 

Phe Glu Cys Gly Ser Val Ala 
195 

Phe Gly lie Glu Thr Lys Val 
210 215 



Arg Val Ser Gly Glu Glu Gin Gin Thr 
1G 15 

Gin lie Glu Arg Leu Arg Asn Ala Gly 
25 . 30 

lie Gin Ser Arg Thr Thr. Glu Val Arg 
40 45 

Asn Asp Leu Lys Thr Ser Ser Thr Gly 
60 

Thr Arg He Asp Arg He Gly Ser Ser 
75 ~ 80 

Leu Glu Val Leu Arg Ser Lys Gly He 
90 * 95 

Gly Val Asp Pro Asp Ser Leu Gly Gly 
105 no 

Leu Ala Ala Ala Lys Phe Glu Val Arq 
120 125 

Ser Glu Arg Arg His Arg Val Asn Gin 
140 

Pro Leu Gly Tyr Arg Lys Asp Lys Asp 
155 160 

Pro Cys Val Cys Leu Leu Glu Gly Arg 
170 175 

Leu Ala Gin Tyr He Phe His Thr Phe 
185 190 

Ala Thr Val Arg Lys Leu His Ser Asp 
200 205 

Leu Asn Trp Asn Lys Leu Glu Lys Ser 
220 



Ser Arg He Val Gly Asp Asp 
225 230 

Asn Lys Thr Asn His Pro Leu 
245 

Ser He Pro Gly Leu Lys Ala 
2 60 

Gly Leu Pro Phe Asp Thr Tyr 

275 

Phe Asp Glu Trp Lys Val Lys 
290 " 295 

He Thr Cys Glu Glu His Glu 
305 310 



Asp Leu Asp Lys He Ala Phe Thr Pro 
235 240 

Arg Tyr Pro Trp Ser Gly Leu Arg Trp 
250 255 

Leu Leu Val Asn Pro Val Tyr Ala Gly 
265 270 

Val Lys Ser Lys Gly Lys Arg Lys His 
280 285 

Trp Gly Thr His Asp Asp Glu Ala He 
300 

Arg He Lys Gin Met He Arg Asp Asn 
315 320 



82 



Arg Asn Asn Arg Trp Ala Ala Arg Glu Glu Asn Glu Val Asn Pro Phe 
325 330 335 

Ser Asn Leu Leu Lys Cys Thr His Cys Gly Gly Ser Met Thr Arg His 
340 345 350 

Ala Lys Arg Val Asp Lys Ser Gly Gin Ala lie Tyr Tyr Tyr Gin Cys 
355 360 365 

Arg Leu Tyr Lys Ala Gly Asn Cys Ser Asn Lys Asn Met lie Ser Ser 
370 375 380 

Lys lie Leu Asp He Gin Val Met Asp Leu Leu Ala Gin Glu Ala Glu 
385 390 395 400 

Arg Leu Ala Asn Leu Val Glu Thr Asp Glu Pro Leu He Val Glu Glu 
405 410 415 

Pro Pro Glu Val Lys Thr Leu Arg Ala Ser Leu Asn Ser Leu Glu Thr 
420 425 430 

Leu Pro Ala Ser Ser Ala He Glu Gin He Lys Asn Asp Leu Lys Glu 
435 440 445 

Gin He Ala He Ala Leu Gly Ala Thr Asn Asn Ala Ser Lys Gin Ser 
4 50 4 55 4 60 

Leu He Ala Lys Glu Arg He He Gin Ala Phe Ala His Lys Ser Tyr 
465 470 475 ^ 480 

Trp Gin Gly Leu Asn Ala Gin Asp Lys Arg Ala He Leu Asn Gly Cys 
485 490 495 

Val Lys Lys He Ser Val Asp Gly Asn Phe Val Thr Ala lie Glu Tyr 
500 505 510 

Arg Tyr 



<210> 60 

<211> 2124 

<212> DNA 

<213> Transposon Tn4451 

<220> 

<221> CDS 

<222> (1) . . (2121) 

<400> 60 

atg tea agg act tea aga att aca gca ctt tac gag cgt ttg tea aga 

Met Ser Arg Thr Ser Arg lie Thr Ala Leu Tyr Glu Arg Leu Ser Arg 
1 5 10 15 

gat gat gac ctt act ggc gag agt aat tct att acc aat caa aag aaa 
Asp Asp Asp Leu Thr Gly Glu Ser Asn Ser lie Thr Asn Gin Lys Lys 
20 25 30 

tac etc gaa gat tat gec cgt agg aat ggt ttt gag aac att cgc cat 
Tyr Leu Glu Asp Tyr Ala Arg Arg Asn Gly Phe Glu Asn lie Arg His 
35 40 45 

ttt acc gat gac gga ttt teg ggt gta aat ttc aat cgc cct ggc ttt 
Phe Thr Asp Asp Gly Phe Ser Gly Val Asn Phe Asn Arg Pro Gly Phe 
50 55 60 



83 



caa tct ctg ata aaa gaa gtt gaa gca gga aat gta gaa acc ttg att 
Gin Ser Leu lie Lys Glu Val Glu Ala Gly Asn Val Glu Thr Leu He 

65 ~ 70 15 80 

gtt aag gat atg age cga ttg ggg cga aat tat ctg caa gta ggt ttt 
Val Lys Asp Met Ser Arg Leu Gly Arg Asn Tyr Leu Gin Val Gly Phe 
8 5 90 95 

tat acg gaa gtt ctg ttt cca cag aaa aat gtc cgt ttc ctt gca att 
Tyr Thr Glu Val Leu Phe Pro Gin Lys Asn Val Arg Phe Leu Ala He 
100 105 110 

aac aac agt att gac agt aac aac get teg gat aat gac ttt get ccg 
Asn Asn Ser He Asp Ser Asn Asn Ala Ser Asp Asn Asp Phe Ala Pro 
115 120 ~ 125 

ttt ttg aat att atg aac gaa tgg tat gee aaa gac aca age aac aaa 
Phe Leu Asn He Met Asn Glu Trp Tyr Ala Lys Asp Thr Ser Asn Lys 
130 135 140 

ate aag get ata ttc gat gee cgt atg aaa gac gga aag cgt tgt age 
He Lys Ala He Phe Asp Ala Arg Met Lys Asp Gly Lys Arg Cys Ser 
145 150 155 160 

ggt tct ate cct tat ggg tat aac cga ctg ccg age gac aaa caa acg 
Gly Ser He Pro Tyr Gly Tyr Asn Arg Leu Pro Ser Asp Lys Gin Thr 
165 170 * 175 

ctt gtg gtt gac cct gtg get teg gaa gtg gta aag cgt ate ttt act 
Leu Val Val Asp Pro Val Ala Ser Glu Val Val Lys Arg He Phe Thr 
180 185 190 

ctt gee aat gat ggc aaa agt aca agg gca ate gca gaa ata ctg acc 
Leu Ala Asn Asp Gly Lys Ser Thr Arg Ala He Ala Glu He Leu Thr 
195 * 200 205 

gaa gaa aaa gtt tta acc cct gcg gca tac gca aag gaa tac cac ccc 
Glu Glu Lys Val Leu Thr Pro Ala Ala Tyr Ala Lys Glu Tyr His Pro 
210 215 ' 220 

gaa cag tac aac ggc aac aag ttc aca aac cct tat ctt tgg gca atg 
Glu Gin Tyr Asn Gly Asn Lys Phe Thr Asn Pro Tyr Leu Trp Ala Met 
225 230 235 240 

tea acg ata aga aat att tta ggc agg cag gaa tat etc ggt cac acc 
Ser Thr He Arg Asn He Leu Gly Arg Gin . Glu Tyr Leu Gly His Thr 
245 250 255 

gtt ttg cga aag teg gta age aca aat ttc aaa ctt cac aag aga aaa 
Val Leu Arg Lys Ser Val Ser Thr Asn Phe Lys Leu His Lys Arg Lys 
260 265 270 

age aca gac gaa gaa gaa cag tat gta ttt ccg aat aca cac gag cct 
Ser Thr Asp Glu Glu Glu Gin Tyr Val Phe Pro Asn Thr His Glu Pro 
275 280 285 

ate ata teg cag gaa ctt tgg gac age gtt caa aaa cgc aga age aga 
He He Ser Gin Glu Leu Trp Asp Ser Val Gin Lys Arg Arg Ser Arg 
290 295 300 

gta aat cgt gee teg get tgg gga acg cac age aac cgt tta age gga 
Val Asn Arg Ala Ser Ala Trp Gly Thr His Ser Asn Arg Leu Ser Gly 
305 310 315 320 



tat ttg tac tgt gee gat tgc gga aga aga atg act ttg cag aca cat 
Tyr Leu Tyr Cys Ala Asp Cys Gly Arg Arg Met Thr Leu Gin Thr His 

325 330 335 



1! 'nwr'ii '% 



tac age aaa aaa gac ggt tct gtg cag tat tct tac cgt tgc ggt ggg 1056 
Tyr .Ser Lys Lys Asp Gly Ser Val Gin Tyr Ser Tyr Arg Cys Gly Gly 
340 345 350 

tat gca age aga gtg aac agt tgt acc agt cat teg att agt acc gat 1104 
Tyr Ala Ser Arg Val Asn Ser Cys Thr Ser His Ser lie Ser Thr Asp 
355 360 365 

10 aat gtt gaa gee ttg ata tta tea tct gtc aaa cgc ttt tea agg ttt 1152 
Asn Val Glu Ala Leu lie Leu Ser Ser Val Lys Arg Phe Ser Arg Phe 
370 375 380. 

gtt ctg aat gat gaa caa gca ttt get ttg gaa ctg caa tct ctt tgg 1200 
15 Val Leu Asn Asp Glu Gin Ala Phe Ala Leu Glu Leu Gin Ser Leu Trp 
385 390 395 400 

aat gaa aaa cag gag gaa aag.ccg aaa cac aat caa teg gaa ctg caa 1248 
Asn Glu Lys Gin Glu Glu Lys Pro Lys His Asn Gin Ser Glu Leu Gin 
i0 405 410 415 

; cgc tgt cag aaa cgc tat gac gaa etc tct acc ctt gtt cgt ggc ttg 1296 

Arg Cys Gin Lys Arg Tyr Asp Glu Leu Ser Thr Leu Val Arg Gly Leu 
420 425 430 



15 

! 



35 



45 



65 



tat gaa aat ctt atg teg gga tta ctg ccc gaa aga cag tat aag caa 1344 
Tyr Glu Asn Leu Met Ser Gly Leu Leu Pro Glu Arg Gin Tyr Lys Gin 
435 440 445 



30 ctg atg aaa cag tat gat gac gag cag gea gag ttg gaa acg aaa atg 1392 
Leu Met Lys Gin Tyr Asp Asp Glu Gin Ala Glu Leu Glu Thr Lys Met 
4 50 4 55 4 60 



gaa acg atg aaa aca gaa ctt gec gaa gaa aaa gta agt tec gtt gat 1440 
Glu Thr Met Lys Thr Glu Leu Ala Glu Glu Lys Val Ser Ser Val Asp 
465 470 475 480 



att aag cat ttc att teg ctg ata cgc aag tgt aaa aat cct acg gaa 1488 
^ lie Lys His Phe lie Ser Leu lie Arg Lys Cys Lys Asn Pro Thr Glu 

40 485 490 ' 495 



ate tec gat aca atg ttt aat gaa ctt gtt gat aag ata gtg gtt tat 1536 
lie Ser Asp Thr Met Phe Asn Glu Leu Val Asp Lys lie Val Val Tyr 
500 505 510 

gaa gca gag ggt gtg gga aaa gca cga aca caa aag gtc gat att tat 1584 
Glu Ala Glu Gly Val Gly Lys Ala Arg Thr Gin Lys Val Asp lie Tyr 
515 520 525 



50 ttt aac tat gtc ggt caa gtg gat att gec tat acc gaa gaa gaa ctt 
Phe Asn Tyr Val Gly Gin Val Asp He Ala Tyr Thr Glu Glu Glu Leu 
530 535 540 



1632 



gec gag ata gaa aca cag aaa gag cag gag gaa cag caa cgc ttg gca 1680 
55 Ala Glu He Glu Thr Gin Lys Glu Gin Glu Glu Gin Gin Arg Leu Ala 
54 5 550 555 560 

aga cag cgc aag cgt gaa aaa gec tac cga gaa aag cga aag gca cag 1728 
Arg Gin Arg Lys Arg Glu Lys Ala Tyr Arg Glu Lys Arg Lys Ala Gin 
oO 565 570 575 



aaa ate get gaa aac ggt ggc gaa ate gtt aag aca aag gtt tgc cct 1776 
Lys He Ala Glu Asn Gly Gly Glu He Val Lys Thr Lys Val Cys Pro 
580 585 590 

cat tgc aac aaa gag ttt ate ccg aca age aac cga cag gtg ttc tgt 1824 
His Cys Asn Lys Glu Phe He Pro Thr Ser Asn Arg Gin Val Phe Cys 



10 



85 

595 600 605 

tec aaa gag tgc tgc tat caa gca agg caa gac aaa aag aaa aca gac 1872 
Ser Lys Glu Cys Cys Tyr Gin Ala Arg Gin Asp Lys Lys Lys Thr Asp 
610 " ' 615 " 620 

cga gaa gca gaa cga gga aat cac tat tac cga cag cgt gta tgt get 1920 
Arg Glu Ala Glu Arg Gly Asn His Tyr Tyr Arg Gin Arg Val Cys Ala 
625 630 " 635 ' ' 640 

gtg tgc ggc aat tec tat tgg cct aca cac age caa cag aaa ttc tgc 1968 
Val Cys Gly Asn Ser Tyr Trp Pro Thr His Ser Gin .Gin Lys Phe Cys 
645 650 655 

15 tec gaa gaa tgt caa agg gta aat cac aat aag aaa aca ttg gaa ttt 2016 
Ser Glu Glu Cys Gin Arg Val Asn His Asn Lys Lys Thr Leu Glu Phe 
660 665 * 670 

tac cac cat aaa aaa gaa aag gag aag ctg caa tgc aaa gat tta tea 2064 
10 Tyr His His Lys Lys Glu Lys Glu Lys Leu Gin. Cys Lys Asp Leu Ser 
j " 675 ' ~ 680 " 685 

i cag acg aaa gaa egg gta tec gat atg aac tta teg ggg act att act 2112 

Gin Thr Lys Glu Arg Val Ser Asp Met Asn Leu Ser Gly Thr lie Thr 
15 690 695 - 700 

acc cct get taa 2124 
: Thr Pro Ala 
705 

30 

<210> 61 
<211> 707 
i <212> PRT 

35 <213> Transposon Tn4451 

I 

<400> 61 

Met Ser Arg Thr Ser Arg lie Thr Ala Leu Tyr Glu Arg Leu Ser Arg 
1 5 10 15 

Asp Asp Asp Leu Thr Gly Glu Ser Asn Ser lie Thr Asn Gin Lys Lys 
20 25 30 



40 



Tyr Leu Glu Asp Tyr Ala Arg Arg Asn Gly Phe Glu Asn lie Arg His 
45 35 40 " 45 

Phe Thr Asp Asp Gly Phe Ser Gly Val Asn Phe Asn Arg Pro Gly Phe 
50 55 60 

50 Gin Ser Leu He Lys Glu Val Glu Ala Gly Asn Val Glu Thr Leu He 
65 70 75 80 



55 



Val Lys Asp Met Ser Arg Leu Gly Arg Asn Tyr Leu Gin Val Gly Phe 
85 90 95 

Tyr Thr Glu Val Leu Phe Pro Gin Lys Asn Val Arg Phe Leu Ala He 
100 105 110 



Asn Asn Ser lie Asp Ser Asn Asn Ala Ser Asp Asn Asp Phe Ala Pro 

60 115 120 125 

Phe Leu Asn He Met Asn Glu Trp Tyr Ala Lys Asp Thr Ser Asn Lys 

130 135 " 140 

65 He Lys Ala He Phe Asp Ala Arg Met Lys Asp Gly Lys Arg Cys Ser 

145 150 ' 155 ' 160 



Gly Ser lie Pro Tyr Gly Tyr Asn Arg Leu Pro Ser Asp Lys Gin Thr 
165 170 175 

Leu Val Val Asp Pro Val Ala Ser Glu Val Val Lys Arg lie Phe Thr 

5 180 185 190 

Leu Ala Asn Asp Gly Lys Ser Thr Arg Ala lie Ala Glu lie Leu Thr 
/ 195 200 205 

10 Glu Glu Lys Val Leu Thr Pro Ala Ala Tyr Ala Lys Glu Tyr His Pro 
210 215 220 



15 



45 



60 



Glu Gin Tyr Asn Gly Asn Lys Phe Thr Asn Pro Tyr Leu Trp Ala Met 

225 230 235 240 

Ser Thr lie Arg Asn lie Leu Gly Arg Gin Glu Tyr Leu Gly His Thr 

245 250 255 



Val Leu Arg Lys Ser Val Ser Thr Asn Phe Lys Leu His Lys Arg Lys 

,-- ' 20 260 265 270 

I ! Ser Thr Asp Glu Glu Glu Gin Tyr Val Phe Pro Asn Thr His Glu Pro 

j 275 280 285 

25 lie He Ser Gin Glu Leu Trp Asp Ser Val Gin Lys Arg Arg Ser Arg 
290 295 300 

Val Asn Arg Ala Ser Ala Trp Gly Thr His Ser Asn Arg Leu Ser Gly 
| 305 310 315 320 

30 

i Tyr Leu Tyr Cys Ala Asp Cys Gly Arg Arg Met Thr Leu Gin Thr His 
i 325 330 335 

f Tyr Ser Lys Lys Asp Gly Ser Val Gin Tyr Ser Tyr Arg Cys Gly Gly 
! 35 340 345 350 

Tyr Ala Ser Arg Val Asn Ser Cys Thr Ser His Ser He Ser Thr Asp 
355 360 365 

40 Asn Val Glu Ala Leu He Leu Ser Ser Val Lys Arg Phe Ser Arg Phe 
370 375 380 



Val Leu Asn Asp Glu Gin Ala Phe Ala Leu Glu Leu Gin Ser Leu Trp 
385 390 395 400 

Asn Glu Lys Gin Glu Glu Lys Pro Lys His Asn Gin Ser Glu Leu Gin 
405 410 415 



Arg Cys Gin Lys Arg Tyr Asp Glu Leu Ser Thr Leu Val Arg Gly Leu 
50 420 425 430 

Tyr Glu Asn Leu Met Ser Gly Leu Leu Pro Glu Arg Gin Tyr Lys Gin 
435 440 445 

55 Leu Met Lys Gin Tyr Asp Asp Glu Gin Ala Glu Leu Glu Thr Lys Met 
450 455 460 



Glu Thr Met Lys Thr Glu Leu Ala Glu Glu Lys Val Ser Ser Val Asp 
465 470 475 480 

He Lys His Phe He Ser Leu He Arg Lys Cys Lys Asn Pro Thr Glu 
485 490 495 



He Ser Asp Thr Met Phe Asn Glu Leu Val Asp Lys He Val Val Tyr 
65 500 505 510 

Glu Ala Glu Gly Val Gly Lys Ala Arg Thr Gin Lys Val Asp He Tyr 



515 



520 



525 



Phe Asn Tyr Val Gly Gin Val Asp lie Ala Tyr Thr Glu Glu Glu Leu 

530 535 540 

5 

Ala Glu He Glu Thr Gin Lys Glu Gin Glu Glu Gin Gin Arg Leu Ala 
545 550 555 560 

Arg Gin Arg Lys Arg Glu Lys Ala Tyr Arg Glu Lys Arg Lys Ala Gin 
10 565 570 575 



15 



20 



Lys He Ala Glu Asn Gly Gly Glu He Val Lys Thr. Lys Val Cys Pro 
580 585 590 

His Cys Asn Lys Glu Phe He Pro Thr Ser Asn Arg Gin Val Phe Cys 
595 600 605 

Ser Lys Glu Cys Cys Tyr Gin Ala Arg Gin Asp Lys Lys Lys Thr Asp 
610 615 620 

Arg Glu Ala Glu Arg Gly Asn His Tyr Tyr Arg iGln Arg Val Cys Ala 
625 630 635 64 0 



! Val Cys Gly Asn Ser Tyr Trp Pro Thr His Ser Gin Gin Lys Phe Cys 
25 645 650 655 



30 



Ser Glu Glu Cys Gin Arg Val Asn His Asn Lys Lys Thr Leu Glu Phe 
660 665 670 

Tyr His His Lys Lys Glu Lys Glu Lys Leu Gin Cys Lys Asp Leu Ser 

675 * 680 685 



35 



Gin Thr Lys Glu Arg Val Ser Asp Met Asn Leu Ser Gly Thr He Th: 
690 695- 700 

Thr Pro Ala 
705 



45 



40 

<210> 62 
<211> 1420 
. <212> DNA 

<213> XisA recombinase Z 

<220> 

<221> CDS 

<222> (1) . . (1416) 

50 <400> 62 

atg caa.aat cag ggt caa gac aaa tat caa caa gcc ttt gca gac tta 48 

Met Gin Asn Gin Gly Gin Asp Lys Tyr Gin Gin Ala Phe Ala Asp Leu 

15 10 15 

55 gag cca ctt tea tct acc gac ggc agt ttt etc ggc tea agt ctg caa 96 
Glu Pro Leu Ser Ser Thr Asp Gly Ser Phe Leu Gly Ser Ser Leu Gin 
20 25 30 

gca cag cag caa aga gaa cac atg aga aca aaa gta eta caa gac eta 14 4 
60 Ala Gin Gin Gin Arg Glu His Met Arg Thr Lys Val Leu Gin Asp Leu 
35 40 45 

gac aag gta aat ctg cgt ttg aag tct gca aag acg aaa gtc tea gtt 192 
Asp Lys Val Asn Leu Arg Leu Lys Ser Ala Lys Thr Lys Val Ser Val 
65. 50 55 60 

cga gaa tct aac gga agt ctg caa tta cga gca acg tta cca att aaa 24 0 



Arg Glu Ser Asn Gly Ser Leu Gin Leu Arg Ala Thr Leu Pro He Lys 



65 70 75 



80 



cct gga gat aag gac acc aac ggt aca ggc aga aag caa tac aat etc 
Pro Gly Asp Lys Asp Thr Asn Gly Thr Gly Arg Lys Gin Tyr Asn Leu 
85 90 95 



320 



aga caa gca tta ccc tta cat aaa gaa tgg att gat gag ttt gat tta 
Arg Gin Ala Leu Pro Leu His Lys Glu Trp He Asp Glu Phe Asp Leu 
325 330 335 



288 



age ttg aat ate cct gca aac ttg gat gga ctg aag acg get gag gaa 336 
Ser Leu Asn He Pro Ala Asn Leu Asp Gly Leu Lys Thr Ala Glu Glu 
100 105 110 

gaa get tat gaa tta ggt aaa tta ate get egg aaa . acc ttt gaa tgg 384 
Glu Ala Tyr Glu Leu Gly Lys Leu lie Ala Arg Lys Thr Phe Glu Trp 
115 120 125 

aat gat aaa tat tta ggc aaa gaa gec act aaa aaa gat tea caa aca 4 32 
Asn Asp Lys Tyr Leu Gly Lys Glu Ala Thr Lys Lys Asp Ser Gin Thr 
130 135 140 

ata ggt gat tta eta gaa aaa ttt gca gaa gag tat ttt aaa acc cat 4 80 
He Gly Asp Leu Leu Glu Lys Phe Ala Glu Glu Tyr Phe Lys Thr His 
145 150 155 



528 



160 

aaa cgc acc act aaa age gaa cat acc ttt ttt tac tat ttt tec cgc 
Lys Arg Thr Thr Lys Ser Glu His Thr Phe Phe Tyr Tyr Phe Ser Arg 
165 170 175 

acc caa cga tat acc aat tec aaa gat tta gca acg gcg gaa aat etc 57 6 
Thr Gin Arg Tyr Thr Asn Ser Lys Asp Leu Ala Thr Ala Glu Asn Leu 
180 185 190 

ate aat tea att gag caa ate gat aaa gaa tgg gcg aga tat aat gec 624 
He Asn Ser He Glu Gin He Asp Lys Glu Trp Ala Arg Tyr Asn Ala 
195 200 205 

gec aga gee ata tea get ttt tgc ata aca ttc aat ata gaa att gat 
Ala Arg Ala He Ser Ala Phe Cys He Thr Phe Asn He Glu He Asp 
210 215 220 

ttg tec cag tat tec aaa atg cct gat cgc aat teg cgc aac ate ccc 
Leu Ser Gin Tyr Ser Lys Met Pro Asp Arg Asn Ser Arg Asn lie Pro 
225 230 . 235 24 0 

aca gat gca gaa ata eta tea gga att acc aaa ttt gaa gac tat eta 
Thr Asp Ala Glu He Leu Ser Gly He Thr Lys Phe Glu Asp Tyr Leu 
245 250 255 

gtt acc aga gga aat caa gtt aat gaa gat gta aaa gat age tgg caa 
Val Thr Arg Gly Asn Gin Val Asn Glu Asp Val Lys Asp Ser Trp Gin 
260 - 265. 270 

ctt tgg cgc tgg aca tat gga atg tta gca gtt ttt ggt tta cgc ccc 
Leu Trp Arg Trp Thr Tyr Gly Met Leu Ala Val Phe Gly Leu Arg Pro 
275 280 285 

agg gaa att ttt att aac cct aat att gat tgg tgg tta age aaa gag 
Arg Glu He Phe He Asn Pro Asn He Asp Trp Trp Leu Ser Lys Glu 
290 295 300 

aat ata gac etc aca tgg aaa gta gac aaa gaa tgt aaa act ggt gaa 
Asn He Asp Leu Thr Trp Lys Val Asp Lys Glu Cys Lys Thr Glv Glu 
305 310 315 



672 



720 



768 



816 



864 



912 



960 



1008 



10 



20 



25 



45 



55 



65 



89 

aga aat ccg aaa tat tta gaa atg ctg gca aca gca att agt aaa aaa 1-056 

Arg Asn Pro Lys Tyr Leu Glu Met Leu Ala Thr Ala He Ser Lys Lys 

34 0 345 350 

gat aaa aca aat cat get gaa ata aca gec tta act cag cgt att agt 1104 

Asp Lys Thr Asn His Ala Glu He Thr Ala Leu Thr Gin Arg He Ser 

355 360 365 

tgg tgg ttt egg aaa gtc gaa tta gat ttt aaa ccc tat gat tta cgt 1152 

Trp Trp Phe Arg Lys Val Glu Leu Asp Phe Lys Pro Tyr Asp Leu Arg 

370 375 380 



cac gec tgg gca ate aga gcg cat att tta ggc.ata cca ate aaa gcg - 1200 
His Ala Trp Ala He Arg Ala His lie Leu Gly He Pro He Lys Ala 
15 385 390 395 400 



gcg get gat aat ttg ggg cat agt atg cag gtt cat aca caa acc tat 1248 
Ala Ala Asp Asn Leu Gly His Ser Met Gin Val His Thr Gin Thr Tyr 
405 410 415 

cag cgc tgg ttc teg eta gat atg egg aag tta gcg att aat cag get 1296 
Gin Arg Trp Phe Ser Leu Asp Met Arg Lys Leu Ala He Asn Gin Ala 
420 425 430 

ttg act aag agg aat gaa ttt gag gtg att agg gag gag aat get aaa 1344 
Leu Thr Lys Arg Asn Glu Phe Glu Val He Arg Glu Glu Asn Ala Lys 
435 440 445 



ttg ca 9 ata gaa aat gaa agg ttg agg atg gaa att gag aag tta aag 1392 
30 Leu Gin He Glu Asn Glu Arg Leu Arg Met Glu He Glu Lys Leu Lys 
450 455 4 60 



atg gaa ata get tat aag aat agt tgag 1420 
| Met Glu lie Ala Tyr Lys Asn Ser 

' 35 4 65 470 

i 

i <210> 63 

<211> 472 
40 <212> PRT 

<213> XisA recombinase 



<400> 63 

Met Gin Asn Gin Gly Gin Asp Lys Tyr Gin Gin Ala Phe Ala Asp Leu 
1 5 10 15 

Glu Pro Leu Ser Ser Thr Asp Gly Ser Phe Leu Gly Ser Ser Leu Gin 
20 25 30 

50 Ala Gin Gin Gin Arg Glu His Met Arg Thr Lys Val Leu Gin Asp Leu 
35 40 45 



Asp Lys Val Asn Leu Arg Leu Lys Ser Ala Lys Thr Lys Val Ser Val 
50 55 60 

Arg Glu Ser Asn Gly Ser Leu Gin Leu Arg Ala Thr Leu Pro He Lys 
65 70 75 80 



Pro Gly Asp Lys Asp Thr Asn Gly Thr Gly Arg Lys Gin Tyr Asn Leu 
60 85 90 95 



Ser Leu Asn He Pro Ala Asn Leu Asp Gly Leu Lys Thr Ala Glu Glu 
100 105 " 110 

Glu Ala Tyr Glu Leu Gly Lys Leu He Ala Arg Lys Thr Phe Glu Trp 
115 120 125 
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Asn Asp Lys Tyr Leu Gly Lys Glu Ala Thr Lys Lys Asp Ser Gin Thr 
130 135 -L4Q 

He Gly Asp Leu Leu Glu Lys Phe Ala Glu Glu Tyr Phe Lys Thr His 
145 150 155 160 

Lys Arg Thr Thr Lys Ser Glu His Thr Phe Phe Tyr Tyr Phe Ser Arg 
165 170 175 

Thr Gin Arg Tyr Thr Asn Ser Lys Asp Leu Ala Thr Ala Glu Asn Leu 
180 185 190 

He Asn Ser He Glu Gin He Asp Lys Glu Trp Ala 'Arg Tyr Asn Ala 
195 200 205 

Ala Arg Ala He Ser Ala Phe Cys He Thr Phe Asn He Glu He Asp 
210 215 220 



20 



25 



50 



35 



40 



45 



Leu Ser Gin Tyr Ser Lys Met Pro Asp Arg Asn Ser Arg Asn He Pro 

225 230 235 240 

Thr Asp Ala Glu He Leu Ser Gly He Thr Lys Phe Glu Asp Tyr Leu 
245 250 255 

Val Thr Arg Gly Asn Gin Val Asn Glu Asp Val Lys Asp Ser Trp Gin 
260 265 ' 270 

Leu Trp Arg Trp Thr Tyr Gly Met Leu Ala Val Phe Gly Leu Arg Pro 

275 280 285 

Arg Glu lie Phe He Asn Pro Asn He Asp Trp Trp Leu Ser Lys Glu 
290 295 300 

Asn lie Asp Leu Thr Trp Lys Val Asp Lys Glu Cys Lys Thr Gly Glu 
305 310 315 320 

Arg Gin Ala Leu Pro Leu His Lys Glu Trp He Asp Glu Phe Asp Leu 
325 330 335 

Arg Asn Pro Lys Tyr Leu Glu Met Leu Ala Thr Ala He Ser Lys Lys 
340 345 350 

Asp Lys Thr Asn His Ala Glu He Thr Ala Leu Thr Gin Arg He Ser 

355 ' 360 365 

Trp Trp Phe Arg Lys Val Glu Leu Asp Phe Lys Pro Tyr Asp Leu Arg 
370 375 380 



His Ala Trp Ala He Arg Ala His He Leu Gly He Pro He Lys Ala 
385 390 395 400 

Ala Ala Asp Asn Leu Gly His Ser Met Gin Val His Thr Gin Thr Tyr 
405 410 415 

Gin Arg Trp Phe Ser Leu Asp Met Arg Lys Leu Ala He Asn Gin Ala 
420 425 430 

Leu Thr Lys Arg Asn Glu Phe Glu Val He Arg Glu Glu Asn Ala Lys 
4 35 440 445 

Leu Gin He Glu Asn Glu Arg Leu Arg Met Glu He Glu Lys Leu Lys 
450 455 460 

Met Glu He Ala Tyr Lys Asn Ser 
65. 465 470 



50 



55 



60 



10 



<210> 64 
<211> 1008 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1005) 



<220> 

<223> Description of Artificial Sequence: vector 
pBS-SSV3 

15 <400> 64 

atg acg aaa gat aag acg cgt tat aaa tac ggg gat tat att tta cgc 4 8 
Met Thr Lys Asp Lys Thr Arg Tyr Lys Tyr Gly Asp Tyr He Leu Arg 
,1 5 10 15 

20 gag agg aaa ggg egg tat tat gtt tac aag eta gag tat gaa aac ggt 96 
Glu Arg Lys Gly Arg Tyr Tyr Val Tyr Lys Leu Glu Tyr Glu Asn Gly 
20 25 30 

gag gta aaa gag cgt tac gtg ggt cct tta get gac gtc gtt gaa tea 144 
25 Glu Val Lys Glu Arg Tyr Val Gly Pro Leu Ala Asp Val Val Glu Ser 
35 40 45 

tat eta aaa atg aaa tta ggg gtc gta ggg gat act ccc eta caa gcg 192 
Tyr Leu Lys Met Lys Leu Gly Val Val Gly Asp Thr Pro Leu Gin Ala 
50 50 55 60 

gat ccc ccc ggt ttc gag ccc ggg aca age gga age ggt ggt gga aaa 240 
. Asp Pro Pro Gly Phe Glu Pro Gly Thr Ser Gly Ser Gly Gly Gly Lys 
65 '70 75 80 • 

55 

gag gga act gaa cga cgt aaa ata gcg ttg gtt gec aat ttg cgc caa 288 
Glu Gly Thr Glu Arg Arg Lys He Ala Leu Val Ala Asn Leu Arg Gin 
85 90 95 

L- 

40 tac gcg acg gac ggc aac ata aag gcg ttc tac aac tat etc atg aac 336 
Tyr Ala Thr Asp Gly Asn He Lys Ala Phe Tyr Asn Tyr Leu Met Asn 
100 105 110 ... '■ 

gaa agg ggg ata age gaa aaa act gca aag gac tac ate aat get ata 384 
45 Glu Arg Gly He Ser Glu Lys Thr Ala Lys Asp Tyr He Asn Ala He 
115 120 125 

tea aag ccg tat aaa gag acg aga gac gca cag aag get tac cga etc 4 32 
Ser Lys Pro Tyr Lys Glu Thr Arg Asp Ala Gin Lys Ala Tyr Arg Leu 
50 130 135 140 

ttt gca cgt ttc tta gcg tea cgc aat ate ata cat gat gaa ttt gcg 480 

Phe Ala Arg Phe Leu Ala Ser Arg Asn He He His Asp Glu Phe Ala 

145 150 155 160 

55 

gat aaa ata ttg aaa gcg gta aag gtg aag aag gcg aac get gat ate 528 

Asp Lys He Leu Lys Ala Val Lys Val Lys Lys Ala Asn Ala Asp He 
165 170 " 175 

60 tac att cca acg ttg gaa gag ata aaa agg acg tta caa tta gca aaa 576 
Tyr lie Pro Thr Leu Glu Glu lie Lys Arg Thr Leu Gin Leu Ala Lys 
180 185 190 

gac tat age gaa aac gtc tac ttc ate tac cgt ate get etc gag teg 624 
65 Asp Tyr Ser Glu Asn Val Tyr Phe He Tyr Arg He Ala Leu Glu Ser 
195 200 205 



ggc gtt agg ctg age gaa ata ctg aaa gtg ctg aag gaa ccc gaa agg 672 
Gly Val Arg Leu Ser Glu lie Leu Lys Val Leu Lys Glu Pro Glu Arg 

210 215 '" 220 

gac att tgc ggt aac gac gtc tgt tat tat ccg ctt agt tgg act agg 720 
Asp lie Cys Gly Asn Asp Val Cys Tyr Tyr Pro Leu Ser Trp Thr Arg 
225 230 ~ " 235 " 240 

gga tat aag ggc gtc ttc tat gta ttc cac ata acg cct ctg aag aga 768 
Gly Tyr Lys Gly Val Phe Tyr Val Phe His lie Thr Pro Leu Lys Arg 
245 250 255 

gta gag gtg acg aag tgg gca ata gcg gac ttt gaa cga cgt cat aag 816 
Val Glu Val Thr Lys Trp Ala lie Ala Asp Phe Glu Arg Arg His Lys 
260 265 270 

gac get ata gcg ata aag tac ttc cgc aaa ttc gta gcg tct aag atg 864 
Asp Ala lie Ala lie Lys Tyr Phe Arg Lys Phe Val Ala Ser Lys Met 
275 280 " * 285 

get gag eta age gta ccg tta gat att ate gat ttt att caa ggg cgt 912 
Ala Glu Leu Ser Val Pro Leu Asp lie lie Asp Phe lie Gin Gly Arg 
290 295 " 300 

aaa ccg aca cgc gtt tta acg caa cat tac gta teg etc ttc ggc ata 960 
Lys Pro Thr Arg Val Leu Thr Gin His Tyr Val Ser Leu Phe Gly lie 
305 " 310 ' 315 320 

gcg aaa gag caa tat aaa aag tat gcg gaa tgg eta aaa ggg gtc tga 100E 
Ala Lys Glu Gin Tyr Lys Lys Tyr Ala Glu Trp Leu Lys Gly Val 
325 " - 330 335 

<210> 65' - 
<211> 335 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: vector 
pBS-SSV3 

<400> 65 

Met Thr Lys Asp Lys Thr Arg Tyr Lys Tyr Gly Asp Tyr lie Leu Arg 
1 ' " 5 - 10 15 

Glu Arg Lys Gly Arg Tyr Tyr Val Tyr Lys Leu Glu Tyr Glu Asn Gly 
20 "25 30 

Glu Val Lys Glu Arg Tyr Val Gly Pro Leu Ala Asp Val Val Glu Ser 
35 40 45 

Tyr Leu Lys Met Lys Leu Gly Val Val Gly Asp Thr Pro Leu Gin Ala 
50 55 60 

Asp Pro Pro Gly Phe Glu Pro Gly Thr Ser Gly Ser Gly Gly Gly Lys 
65 70 75 80 

Glu Gly Thr Glu Arg Arg Lys lie Ala Leu Val Ala Asn Leu Arg Gin 
85 90 95 

Tyr Ala Thr Asp Gly Asn lie Lys Ala Phe Tyr Asn Tyr Leu Met Asn 
100 105 110 

Glu Arg Gly lie Ser Glu Lys Thr Ala Lys Asp Tyr He Asn Ala He 
115 120 - 125 



Ser Lys Pro Tyr Lys Glu Thr Arg Asp Ala Gin Lys Ala Tyr Arg Leu 
130 135 140 



93 



Phe Ala Arg Phe Leu Ala Ser Arg Asn lie He His Asp Glu Phe Ala 
145 150 155 160 

Asp Lys He Leu Lys Ala Val Lys Val Lys Lys Ala Asn Ala Asp He 
165 ' 170 ' 175 

Tyr lie Pro Thr Leu Glu Glu He Lys Arg Thr Leu Gin Leu Ala Lys 
180 185 : 190 

Asp Tyr Ser Glu Asn Val Tyr Phe He Tyr Arg He Ala Leu Glu Ser 
195 200 .205 

Gly Val Arg Leu Ser Glu He Leu Lys Val Leu Lys Glu Pro Glu Arg 
210 " 215 . 220 

Asp He Cys Gly Asn Asp Val Cys Tyr Tyr Pro Leu Ser Trp Thr Arg 
225 230 " 235 240 

Gly Tyr Lys Gly Val Phe Tyr Val Phe His He Thr Pro Leu Lys Arg 
245 250 255 

Val Glu Val Thr Lys Trp Ala lie Ala Asp Phe Glu Arg Arg His Lys 
260 265 270 

Asp Ala He Ala He Lys Tyr Phe Arg Lys Phe Val Ala Ser Lys Met 
275 ' 280 ' " 285 

Ala Glu Leu Ser Val Pro Leu Asp lie He Asp Phe He Gin Gly Arg 
290 295 " 300 

Lys Pro Thr Arg Val Leu Thr Gin His Tyr Val Ser Leu Phe Gly He 
305 310 315 320 

Ala Lys Glu Gin Tyr Lys Lys Tyr Ala Glu Trp Leu Lys Gly Val 
325 " ~ 330 ~ 335 



<210> 66 
<211> 1441 
<212> DNA 

:<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for fusion protein NLS-XisA 

<220> 

<221> CDS 

<222> (1) . . (1437) 

<400> 66 

atg ccc aag aag aag agg aag gtg caa aat cag ggt caa gac aaa tat 
Met Pro Lys Lys Lys Arg Lys Val Gin Asn Gin Gly Gin Asp Lys Tyr 
15 10 15 

caa caa gcc ttt gca gac tta gag cca ctt tea tct acc gac ggc agt 
Gin Gin Ala Phe Ala Asp Leu Glu Pro Leu Ser Ser Thr Asp Gly Ser 
20 25 30 

ttt etc ggc tea agt ctg caa gca cag cag caa aga gaa cac atg aga 
Phe Leu Gly Ser Ser Leu Gin Ala Gin Gin Gin Arg Glu His Met Arg 
35 40 45 



aca aaa gta eta caa gac eta gac aag gta aat ctg cgt ttg aag tct 
Thr Lys Val Leu Gin Asp Leu Asp Lys Val Asn Leu Arg Leu Lys Ser 



94 
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10 



15 



i 



30 



40 



50 



55 



60 



50 55 60 

gca aag acg aaa gtc tea gtt cga gaa tct aac gga agt ctg caa tta 240 
Ala Lys Thr Lys Val Ser Val Arg Glu Ser Asn Gly Ser Leu Gin Leu 



65 70 



75 80 



cga gca acg tta cca att aaa cct gga gat aag gac acc aac ggt aca 238 
Arg Ala Thr Leu Pro He Lys Pro Gly Asp Lys Asp Thr Asn Gly Thr 
85 90 95 

ggc aga aag caa tap aat etc age ttg aat ate cct gca aac ttg gat 336 
Gly Arg Lys Gin Tyr Asn Leu Ser Leu Asn He Pro .Ala Asn Leu Asp 
100 10 5 HO 

gga ctg aag acg get gag gaa gaa get tat gaa tta ggt aaa tta ate 384 
.Gly Leu Lys Thr Ala Glu Glu Glu Ala Tyr Glu Leu Gly Lys Leu He 
H5 120 125 

-get egg aaa acc ttt gaa tgg aat gat aaa tat tta ggc aaa gaa gec 432 
Ala Arg Lys Thr Phe Glu Trp Asn Asp Lys Tyr Leu Gly Lys Glu Ala 
130 135 140 

act aaa aaa gat tea caa aca ata ggt gat tta eta gaa aaa ttt gca 480 
Thr Lys Lys Asp Ser Gin Thr He Gly Asp Leu Leu Glu Lys Phe Ala 
:5 145 150 155 160 

, gaa gag tat ttt aaa acc cat aaa cgc acc act aaa age gaa cat acc 528 
Glu Glu Tyr Phe Lys Thr His Lys Arg Thr Thr Lys Ser Glu His Thr 
165 170 " 175 



ttt ttt tac tat ttt tec cgc acc caa cga tat acc aat tec aaa gat 
Phe Phe Tyr Tyr Phe Ser Arg Thr Gin Arg Tyr Thr Asn Ser Lys Asp 
I 180 185 190 

,35 tta gca acg gcg gaa aat etc ate aat tea att gag caa ate gat aaa 
; Leu Ala Thr Ala Glu Asn Leu He Asn Ser He Glu Gin He Asp Lvs 

; 195 200 205 



gaa tgg gcg aga tat aat gee gee aga gec ata tea get ttt tgc ata 
Glu Trp Ala Arg Tyr Asn Ala Ala Arg Ala He Ser Ala Phe Cys He 
210 215 220 



240 



cgc aat teg cgc aac ate ccc aca gat gca gaa ata eta tea gga att 
Arg Asn Ser Arg Asn He Pro Thr Asp Ala Glu He Leu Ser Gly He 
245 250 ■ 255 

acc aaa ttt gaa gac tat eta gtt acc aga gga aat caa gtt aat gaa 
Thr Lys Phe Glu Asp Tyr Leu Val Thr Arg Gly Asn Gin Val Asn Glu 
260 265 270 

gat gta aaa gat age tgg caa ctt tgg cgc tgg aca tat gga atg tta 
Asp Val Lys Asp Ser Trp Gin Leu Trp Arg Trp Thr Tyr Gly Met Leu 
275 280 285 

gca gtt ttt ggt tta cgc ccc agg gaa att ttt att aac cct aat att 
Ala Val Phe Gly Leu Arg Pro Arg Glu He Phe He Asn Pro Asn He 
290 295 300 



gat tgg tgg tta age aaa gag aat ata gac etc aca tgg aaa gta qac 
Asp Trp Trp Leu Ser Lys Glu Asn He Asp Leu Thr Trp Lys Val Asp 
«»■ 305 310 315 32 £ 

aaa gaa tgt aaa act ggt gaa aga caa gca tta ccc tta cat aaa gaa 



57 6 



624 



672 



aca ttc aat ata gaa att gat ttg tec cag tat tec aaa atg cct gat 720 
Thr Phe Asn He Glu He Asp Leu Ser Gin Tyr Ser Lys Met Pro Asp 
45 225 230 ". 235 



768 



816 



864 



312 



360 



1008 



95 



Lys Glu Cys Lys Thr Gly Glu Arg Gin Ala Leu Pro Leu His Lys Glu 

325 330 335 

tgg att gat gag ttt gat tta aga aat ccg aaa tat tta gaa atg ctg 1056 
Trp He Asp Glu Phe Asp Leu Arg Asn Pro Lys Tyr Leu Glu Me? Leu 



340 345 



350 



1104 



gca aca gca att agt aaa aaa gat aaa aca aat cat get gaa ata aca 
Ala Thr Ala lie Ser Lys Lys Asp Lys Thr Asn His Ala Glu lie Thr 
355 360 365 

IT r tta IT °f g Cgt att agt tgg tgg ttfc c 9<? aaa -9 tc g aa tta gat 1152 
Ala Leu Thr Gin Arg He Ser Trp Trp Phe Arg Lys Val Glu Leu Asp 
370 375 3 8 o V 

III f aa o cc ^ at gat tta cgt cac gcc tgg gca atc a ^ a g c g ca t att 1200 

Phe Lys Pro Tyr Asp Leu Arg His Ala- Trp Ala He Arg Ala His He 
385 39 0 395 400 

tta ggc ata cca atc aaa gcg'gcg get gat aat ttg ggg cat agt atg 1248 
Leu Gly He Pro He Lys Ala Ala Ala Asp Asn Leu Gly His Ser Met 
405 410 415 

rT IT a at C ? a aC ° t3t Cag Cgc tgg ttc tc 9 cta gat atg egg 1296 

Gin Val Has Thr Gin Thr Tyr Gin Arg Trp Phe Ser Leu Asp Met Arq 

420 425 430 

aag tta gcg att aat cag get ttg act aag agg aat gaa ttt gag gtq 1344 
Lys Leu Ala He Asn Gin Ala Leu Thr Lys Arg Asn Glu Phe Glu Val 
435 440 445 

att agg gag gag aat get aaa ttg cag ata gaa aat gaa agg ttg agg 1392 
He Arg Glu Glu Asn Ala Lys Leu Gin He Glu Asn Glu Arg Leu Arg 
4bu 455 460 

til g f a g f g f ag tta aa g afc g gaa ata get tat aag aat agt tgag 1441 

Met Glu He Glu Lys Leu Lys Met Glu lie Ala Tyr Lys Asn Ser 
46 5 470 475 

<210> 67 
<211> 479 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for fusion protein NLS-XisA 

<400> 67 

Met Pro Lys Lys Lys Arg Lys Val Gin Asn Gin Gly Gin Asp Lys Tyr 
5 10 15 

Gin Gin Ala Phe Ala Asp Leu Glu Pro Leu Ser Ser Thr Asp Gly Ser 

20 25 30 

Phe Leu Gly Ser Ser Leu Gin Ala Gin Gin Gin Arg Glu His Met Arg 
35 40 45 

Thr Lys Val Leu Gin Asp Leu Asp Lys Val Asn Leu Arg Leu Lys Ser 
bU 55 60 

Ala Lys Thr Lys Val Ser Val Arg Glu Ser Asn Gly Ser Leu Gin Leu 

70 75 80 

Arg Ala Thr Leu Pro He Lys Pro Gly Asp Lys Asp Thr Asn Gly Thr 
85 90 95 

Gly Arg Lys Gin Tyr Asn Leu Ser Leu Asn lie Pro Ala Asn Leu Asp 



96 



100 



105 



110 



Gly Leu Lys Thr Ala Glu Glu Glu Ala Tyr Glu Leu Gly Lys Leu lie 
115 120 125 



Ala Arg Lys Thr Phe Glu Trp Asn Asp Lys Tyr Leu Gly Lys Glu Ala 

130 135 ' 140 

Thr Lys Lys Asp Ser Gin Thr lie Gly Asp Leu Leu Glu Lys Phe Ala 

145 " 150 155 " 160 



Glu Glu Tyr Phe Lys Thr His Lys Arg Thr Thr Lys .Ser Glu His Thr 
165 170 175 

Phe Phe Tyr Tyr Phe Ser Arg Thr Gin Arg Tyr Thr Asn 
180 185 

Leu Ala Thr Ala Glu Asn Leu lie Asn Ser He Glu Gin 
195 200 205 

Glu Trp Ala Arg Tyr Asn Ala Ala Arg Ala He Ser Ala Phe Cys He 
210 ' 215 ' 220 

Thr Phe Asn He Glu He Asp Leu Ser Gin Tyr Ser Lys Met Pro Asp 
225 230" * 235 240 

Arg Asn Ser Arg Asn He Pro Thr Asp Ala Glu He Leu Ser Gly He 
245 250 255 

Thr Lys Phe Glu Asp Tyr Leu Val Thr Arg Gly Asn Gin Val Asn Glu 
260 265 ~ 270 

Asp Val Lys Asp Ser Trp Gin Leu Trp Arg Trp Thr Tyr Gly Met Leu 

275 - ■ 280 - 285 

Ala Val Phe Gly Leu Arg Pro Arg Glu lie Phe He Asn Pro Asn He 
290 295 300 

Asp Trp Trp Leu Ser Lys Glu Asn He Asp Leu Thr 
305 310 315 

Lys Glu Cys Lys Thr Gly. ; Glu Arg Gin Ala Leu Pro 
325 330 

Trp He Asp Glu Phe Asp Leu Arg Asn Pro Lys Tyr Leu Glu Met Leu 
340 345 " 350 



Ser Lys Asp 
190 

He Asp Lys 



Trp Lys Val Asp 
320 

Leu His Lys Glu 
335 



Ala Thr Ala 
355 



He Ser Lys Lys 



Asp Lys 
360 



Thr Asn His Ala 
365 



Glu He Thr 



Ala Leu Thr 
370 



Gin Arg lie Ser Trp Trp 
375 



Phe Arg Lys Val 
380 



Glu Leu Asp 



Phe Lys Pro 
385 



Tyr Asp Leu. Arg His Ala 
390 



Trp Ala He Arg 
395 



Ala His lie 
400 



Leu Gly He Pro He Lys ': Ala Ala Ala 
405 



Asp Asn Leu Gly 
410 



His Ser Met 
415 



Gin Val His 



Thr Gin Thr Tyr 
420 



Gin Arg 
425 



Trp Phe Ser Leu 



Asp Met Arg 
430 



Lys Leu Ala 
435 



He Asn Gin Ala 



Leu Thr 
440 



Lys Arg Asn Glu 
4 4 5 



Phe Glu Val 



He Arg Glu 
450 



Glu Asn Ala Lys Leu Gin 
455 



He Glu Asn Glu 
460 



Arg Leu Arg 



/ 
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# 



Met Glu He Glu Lys Leu Lys Met Glu He Ala Tyr Lys Asn Ser 

465 470 475 



<210> 68 
<211> 1029 
<212> DNA 
10 <213> Artificial Sequence 

<220> 



15 



10 



40 



45 



50 



60 



65 



<223> Description of Artificial Sequence: DNA sequence 
coding for fusion protein NLS-Ssv 

<220> 

<221> CDS 

<222> (1) . . (1026) 

<400> 68 

atg ccc aag aag aag agg aag. gtg acg aaa gat aag acg cgt tat aaa 48 
Met Pro Lys Lys Lys Arg Lys Val Thr Lys Asp Lys Thr Arg Tyr Lys 
1 5 10 15 



5 tac ggg gat tat att tta cgc gag agg aaa ggg egg tat tat gtt tac 
Tyr Gly Asp Tyr He Leu Arg Glu Arg Lys Gly Arg Tyr Tyr Val Tyr 
20 25 " " 30 



,35 50 55 



60 



55 130 135 



140 



155 ' 160 



96 



aag eta gag tat gaa aac ggt gag gta aaa gag cgt tac gtg ggt cct 144 
Lys Leu Glu Tyr Glu Asn Gly Glu Val Lys Glu Arg Tyr Val Gly Pro 
35 4 0 45 

tta get gac gtc gtt gaa tea tat eta aaa atg aaa tta ggg gtc gta 192 
Leu Ala Asp Val Val Glu Ser Tyr Leu Lys Met Lys Leu Gly Val Val 



ggg gat act ccc eta caa gcg gat ccc ccc ggt ttc gag ccc ggg aca 240 
Gly Asp Thr Pro Leu Gin Ala Asp Pro Pro Gly Phe Glu Pro Gly Thr 
65 70 75 80 

age gga age ggt ggt gga aaa gag gga act gaa cga cgt aaa ata gcg 
Ser Gly Ser Gly Gly Gly Lys Glu Gly Thr Glu Arg Arg Lys He Ala 
85 90 . 95 

ttg gtt gee aat ttg cgc caa tac gcg acg gac ggc aac ata aag gcg 
Leu Val Ala Asn Leu Arg Gin Tyr Ala Thr Asp Gly Asn He Lvs Ala 
100 105 " HO 

ttc tac aac tat etc atg aac gaa agg ggg ata age gaa aaa act gca 
Phe Tyr Asn Tyr Leu Met Asn Glu Arg Gly lie Ser Glu Lys Thr Ala 
115 120 125 

aag gac tac ate aat get ata tea aag ccg tat aaa gag acg aga gac 
Lys Asp Tyr He Asn Ala He Ser Lys Pro Tyr Lys Glu Thr Arg Asp 



288 



336 



384 



432 



gca cag aag get tac cga etc ttt gca cgt ttc tta gcg tea cgc aat 480 
Ala Gin Lys Ala Tyr Arg Leu Phe Ala Arg Phe Leu Ala Ser Arq Asn 
145 150 



528 



ate ata cat gat gaa ttt gcg gat aaa ata ttg aaa gcg gta aag gtg 

He He His Asp Glu Phe Ala Asp Lys He Leu Lys Ala Val Lys Val 

165 170 175 

aag aag gcg aac get gat ate tac att cca acg ttg gaa gag ata aaa 576 

Lys Lys Ala Asn Ala Asp He Tyr He Pro Thr Leu Glu Glu He Lvs 

180 185 190 
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agg acg tta caa tta gca aaa gac tat age gaa aac gtc tac ttc ate 624 

Arg Thr Leu Gin Leu Ala Lys Asp Tyr Ser Glu Asn Val Tyr Phe lie 

195 ' 200 205 

5 

tac cgt ate get etc gag teg ggc gtt agg ctg age gaa ata ctg aaa 672 

Tyr Arg lie Ala Leu Glu Ser Gly Val Arg Leu Ser Glu lie Leu Lys 
210 215 220 

10 gtg ctg aag gaa ccc gaa agg gac att tgc ggt aac gac gtc tgt tat 720 
Val Leu Lys Glu Pro Glu Arg Asp lie Cys Gly Asn Asp Val Cys Tyr 
225 230 235 . 240 

tat ccg ctt agt tgg act agg gga tat aag ggc gtc ttc tat gta ttc 768 
15 Tyr Pro Leu Ser Trp Thr Arg Gly Tyr Lys Gly Val Phe Tyr Val Phe 

245 ' ' 250 " 255 

cac ata acg cct ctg aag aga gta gag gtg acg aag tgg gca ata gcg 816 
, - His lie Thr Pro Leu Lys Arg Val Glu Val Thr Lys Trp Ala lie Ala 

9 260 265 270 

i - 

t 

| gac ttt gaa cga cgt cat aag gac get ata gcg ata aag tac ttc cgc 864 

"Asp Phe Glu Arg Arg His Lys Asp Ala lie Ala lie Lys Tyr Phe Arg 
, 275 280 285 

■ aaa ttc gta gcg tct aag atg get gag eta age gta ccg tta gat att 912 
; Lys Phe Val Ala Ser Lys Met Ala Glu Leu Ser Val Pro Leu Asp lie 
j 290 295 300 

•0 ate gat ttt att caa ggg cgt aaa ccg aca cgc gtt tta acg caa cat 960 
lie Asp Phe lie Gin Gly Arg Lys Pro Thr Arg Val Leu Thr Gin His 
305 310 315 320 

I 

I 

tac gta teg etc ttc ggc ata gcg aaa gag caa tat aaa aag tat gcg 1008 
15 Tyr Val Ser Leu Phe Gly lie Ala Lys Glu Gin Tyr Lys Lys Tyr Ala 
| 325 330 " 335 

gaa tgg eta aaa ggg gtc tga 1029 
Glu Trp Leu Lys Gly Val 
40 340 

<210> 69 
<211> 342 
45 <212> PRT 

<213> Artificial Seguence 

<223> Description of Artificial Seguence: DNA seguence 
coding for fusion protein NLS-Ssv 

50 <400> 69 

Met Pro Lys Lys Lys Arg Lys Val Thr Lys Asp Lys Thr Arg Tyr Lys 
15 10 15 

Tyr Gly Asp Tyr lie Leu Arg Glu Arg Lys Gly Arg Tyr Tyr Val Tyr 
55 20 25 " 30 

Lys Leu Glu Tyr Glu Asn Gly Glu Val Lys Glu Arg Tyr Val Gly Pro 
35 40 ' 45 

60 Leu Ala Asp Val Val Glu Ser Tyr Leu Lys Met Lys Leu Gly Val Val 
50 55 60 

Gly Asp Thr Pro Leu Gin Ala Asp Pro Pro Gly Phe Glu Pro Gly Thr 
65 70 75 80 

Ser Gly Ser Gly Gly Gly Lys Glu Gly Thr Glu Arg Arg Lys lie Ala 
85 90 95 



65 



99 



Leu Val Ala Asn Leu Arg Gin Tyr Ala 

100 105 

Phe Tyr Asn Tyr Leu Met Asn Glu Arg 
115 120 

Lys Asp Tyr lie Asn Ala lie Ser Lys 
130 135 

Ala Gin Lys Ala Tyr Arg Leu Phe Ala 
145 150 

lie lie His Asp Glu Phe Ala Asp Lys 
165 

Lys Lys Ala Asn Ala Asp lie Tyr lie 
180 185 

Arg Thr Leu Gin Leu, Ala Lys Asp Tyr 
195 200 

Tyr Arg He Ala Leu Glu Ser Gly Val 
210 215 

Val Leu Lys Glu Pro Glu Arg Asp He 

225 230 

Tyr Pro Leu Ser Trp Thr Arg Gly Tyr 
245 

His He Thr Pro Leu Lys Arg Val Glu 
260 265 

Asp Phe Glu Arg Arg His Lys Asp Ala 
275 280 

Lys Phe Val Ala Ser Lys Met Ala Glu 
290 295 

He Asp Phe He Gin Gly Arg Lys Pro 
305 310 

Tyr Val Ser Leu Phe Gly He Ala Lys 
325 

Glu Trp Leu Lys Gly Val 
34 0 



Thr Asp Gly Asn He Lys Ala 

110 

Gly He Ser Glu Lys Thr Ala 
125 

Pro Tyr Lys Glu Thr Arg Asp 
140 

Arg Phe Leu Ala Ser Arg Asn 
155 . 160 

lie Leu Lys Ala Val Lys Val 
170 175 

Pro Thr Leu Glu Glu He Lys 
190 

Ser Glu Asn Val Tyr Phe He 
205 

Arg Leu Ser Glu lie Leu Lys 
220 

Cys Gly Asn Asp Val Cys Tyr 
235 ' 240 

Lys Gly Val Phe Tyr Val Phe 
250 255 

Val Thr Lys Trp Ala He Ala 
270 

He Ala lie Lys Tyr Phe Arg 
285 

Leu Ser Val Pro Leu Asp lie 
300 

Thr Arg Val Leu Thr Gin His 
315 320 

Glu Gin Tyr Lys Lys Tyr Ala 
330 335 



<210> 70 
<211> 3908 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pBS-SSV3 



<400> 70 

cacctaaatt gtaagcgtta atattttgtt 
ctcatttttt aaccaatagg ccgaaatcgg 
cgagataggg ttgagtgttg ttccagtttg 
ctccaacgtc aaagggcgaa aaaccgtcta 
accctaatca agttttttgg ggtcgaggtg 
gagcccccga tttagagctt gacggggaaa 



aaaattcgcg ttaaattttt gttaaatcag 60 
caaaatccct tataaatcaa aagaatagac 120 
gaacaagagt ccactattaa agaacgtgga 180 
tcagggcgat ggcccactac gtgaaccatc 24 0 
ccgtaaagca ctaaatcgga accctaaagg 300 
gccggcgaac gtggcgagaa aggaagggaa 360 



100 



gaaagcgaaa 
caccacaccc 
gcgcaactgt 
agggggatgt 
ttgtaaaacg 
ccgcggtggc 
attttacgcg 
gtaaaagagc 
ttaggggtcg 
agcggaagcg 
ttgcgccaat 

agggggataa 

gagacgagag 
atcatacatg 
gctgatatct 
tatagcgaaa 
gaaatactga 
tatccgctta 
ctgaagagag 
gctatagcga 
ccgttagata 
tacgtatcgc 
ggggtctgac 
ttcgagcttg 
aattccacac 
gagctaactc 
gtgccagctg 
ctcttccgct 
atcagctcac 
gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatgaga 
taaatcaatc 
tgaggcacct 
cgtgtagata 
gcgagaccca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgc 



ggagcgggcg 
gccgcgctta 
tgggaagggc 
gctgcaaggc 
acggccagtg 
ggccgcccga 
agaggaaagg 
gttacgtggg 
taggggatac 
gtggtggaaa 
acgcgacgga 
gcgaaaaaac 
acgcacagaa 
atgaatttgc 
acattccaac 
acgtctactt 
aagtgctgaa 
gttggactag 
tagaggtgac 
taaagtactt 
ttatcgattt 
tcttcggcat 
tcgagggggg 
gcgtaatcat 
aacatacgag 
acattaattg 
cattaatgaa 
tcctcgctca 
tcaaaggcgg 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcata 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 



ctagggcgct 
atgcgccgct 
gatcggtgcg 
gattaagttg 
aattgtaata 
tatgacgaaa 
gcggtattat 
tcctttagct 
tcccctacaa 
agagggaact 
cggcaacata 
tgcaaaggac 
ggcttaccga 
ggataaaata 
gttggaagag 
catctaccgt 
ggaacccgaa 
gggatataag 
gaagtgggca 
ccgcaaattc 
tattcaaggg 
agcgaaagag 
gcccggtacc 
ggtcatagct 
ccggaagcat 
cgttgcgctc 
tcggccaacg 
ctgactcgct 
taatacggtt 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 
ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



ggcaagtgta 
acagggcgcg 
ggcctcttcg 
ggtaacgcca 
cgactcacta 
gataagacgc 
gtttacaagc 
gacgtcgttg 
gcggatcccc 
gaacgacgta 
aaggcgttct 
tacatcaatg 
ctctttgcac 
ttgaaagcgg 
ataaaaagga 
atcgctctcg 
agggacattt 
ggcgtcttct 
atagcggact 
gtagcgtcta 
cgtaaaccga 
caatataaaa 
cagcttttgt 
gtttcctgtg 
aaagtgtaaa 
actgcccgct 
cgcggggaga 
gcgctcggtc 
atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



gcggtcacgc 
tcccattcgc 
ctattacgcc 
gggttttccc 
tagggcgaat 
gttataaata 
tagagtatga 
aatcatatct 
ccggtttcga 
aaatagcgtt 
acaactatct 
ctatatcaaa 
gtttcttagc 
taaaggtgaa 
cgttacaatt 
agtcgggcgt 
gcggtaacga 
atgtattcca 
ttgaacgacg 
agatggctga 
cacgcgtttt 
agtatgcgga 
tccctttagt 
tgaaattgtt 
gcctggggtg 
ttccagtcgg 
ggcggtttgc 
gttcggctgc 
tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat ( 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 



tgcgcgtaac 
cattcaggct 
agctggcgaa 
agtcacgacg 
tggagctcca 
cggggattat 
aaacggtgag 
aaaaatgaaa 
gcccgggaca 
ggttgccaat 
catgaacgaa 
gccgtataaa 
gtcacgcaat 
gaaggcgaac 
agcaaaagac 
taggctgagc 
cgtctgttat 
cataacgcct 
tcataaggac 
gctaagcgta 
aacgcaacat 
atggctaaaa 
gagggttaat 
atccgctcac 
cctaatgagt 
gaaacctgtc 
gtattgggcg 
ggcgagcggt 
acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
ca'tttccccg 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114.0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 

2940 ;' : 

3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 . 
3480 : 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3908 



<210> 71 
<211> 3927 
<212> DNA 

<213> Artificial Sequence 



101 



# 



<220> 

<223> Description of Artificial Sequence: 
pBS-SSV4 



vector 



10 



15 



>0 



15 



40 



45 



50 



55 



60 



65 



<400> 71 

cacctaaatt 

ctcatttttt 

cgagataggg 

ctccaacgtc 

accctaatca 

gagcccccga 

gaaagcgaaa 

caccacaccc 

gcgcaactgt 

agggggatgt 

ttgtaaaacg 

ccgcggtggc 

ttataaatac 

agagtatgaa 

atcatatcta 

cggtttcgag 

aatagcgttg 

caactatctc 

tatatcaaag 

tttcttagcg 

aaaggtgaag 

gttaeaatta 

gtcgggcgtt 

cggtaacgac 

tgtattccac 

tgaacgacgt 

gatggctgag 

aegcgtttta 

gtatgc'ggaa'. 

ccctttagtg 

gaaattgtta 

cctggggtgc 

tccagtcggg 

gcggtttgcg 

ttcggctgcg 

caggggataa 

aaaaggccgc 

atcgacgctc 

cccctggaag 

ccgcctttct 

gttcggtgta 

accgctgcgc 

cgccactggc 

cagagttctt 

gcgctctgct 

aaaccaccgc 

aaggatctca 

actcacgtta 

taaattaaaa 

gttaccaatg 

tagttgcctg 

ccagtgctgc 

accagccagc 

agtctattaa 

acgttgttgc 

tcagctccgg 

cggttagctc 

tcatggttat 

ctgtgactgg 

gctcttgccc 

tcatcattgg 

ccagttcgat 



gtaagcgtta 

aaccaatagg 

ttgagtgttg 

aaagggcgaa 

agttttttgg 

tttagagctt 

ggagcgggcg 

gccgcgctta 

tgggaagggc 

gctgcaaggc 

acggccagtg 

ggccgcacca 

ggggattata 

aacggtgagg 

aaaatgaaat 

cccgggacaa 

gttgccaatt 

atgaaegaaa 

ccgtataaag 

tcacgcaata 

aaggcgaacg 

gcaaaagact 

aggctgagcg 

gtctgttatt 

ataacgcctc 

cataaggacg 

ctaagcgtac 

acgcaacatt 

tggctaaaag' 

agggttaatt 

tccgctcaca 

ctaatgagtg 

aaacctgtcg 

tattgggcgc 

gcgagcggta 

cgcaggaaag 

gttgctggcg 

aagtcagagg 

ctccctcgtg 

cccttcggga 

ggtcgttcgc 

cttatccggt 

agcagccact 

gaagtggtgg 

gaagccagtt 

tggtagcggt 

agaagatcct 

agggattttg 

atgaagtttt 

cttaatcagt 

actccccgtc 

aatgataccg 

cggaagggcc 

ttgttgccgg 

cattgctaca 

ttcccaacga 

cttcggtcct 

ggcagcactg 

tgagtactca 

ggcgtcaata 

aaaacgttct 

gtaacccact 



atattttgtt 
ccgaaatcgg 
ttccagtttg 
aaaccgtcta 
ggtcgaggtg 
gacggggaaa 
ctagggcgct 
atgcgccgct 
gatcggtgcg 
gattaagttg 
aattgtaata 
tgcccaagaa 
ttttacgcga 
taaaagagcg 
taggggtcgt 
gcggaagcgg 
tgcgccaata 
gggggataag 
agacgagaga 
tcatacatga 
ctgatatcta 
atagcgaaaa 
aaatactgaa 
atccgcttag 
tgaagagagt 
ctatagcgat 
cgttagatat 
acgtatcgct 
gggtctgact 
tcgagcttgg 
attccacaca 
agctaactca 
tgccagctgc 
tcttccgctt 
tcagctcact 
aacatgtgag 
tttttccata 
tggcgaaacc 
cgctctcctg 
agcgtggcgc 
tccaagctgg 
aactatcgtc 
ggtaacagga 
cctaactacg 
accttcggaa 
ggtttttttg 
ttgatctttt 
gtcatgagat 
aaatcaatct 
gaggcaccta 
gtgtagataa 
cgagacccac 
gagcgcagaa 
gaagctagag 
ggcatcgtgg 
tcaaggcgag 
ccgatcgttg 
cataattctc 
accaagtcat 
cgggataata 
tcggggcgaa 
cgtgcaccca 



aaaattcgcg 

caaaatccct 

gaacaagagt 

tcagggcgat 

ccgtaaagca 

gccggcgaac 

ggcaagtgta 

acagggcgcg 

ggcctcttcg 

ggtaacgcca 

cgactcacta 

gaagaggaag- 

gaggaaaggg 

ttacgtgggt 

aggggatact 

tggtggaaaa 

cgcgacggac 

cgaaaaaact 

cgcacagaag 

tgaatttgcg 

cattccaacg 

cgtctacttc 

agtgctgaag 

ttggactagg 

agaggtgacg 

aaagtacttc 

tatcgatttt 

cttcggcata 

cgaggggggg 

cgtaatcatg 

acatacgagc 

cattaattgc 

attaatgaat 

cctcgctcac 

caaaggcggt 

caaaaggcca 

ggctccgccc 

cgacaggact 

ttccgaccct 

tttctcatag 

gctgtgtgca 

ttgagtccaa 

ttagcagagc 

gctacactag 

aaagagttgg 

tttgcaagca 

ctacggggtc 

tatcaaaaag 

aaagtatata 

tctcagcgat 

ctacgatacg 

gctcaccggc 

gtggtcctgc 

taagtagttc 

tgtcacgctc 

ttacatgatc 

tcagaagtaa 

ttactgtcat 

tctgagaata 

ccgcgccaca 

aactctcaag 

actgatcttc 



ttaaattttt 
tataaatcaa 
ccactattaa 
ggcccactac 
ctaaatcgga 
gtggcgagaa 
gcggtcacgc 
tcccattcgc 
ctattacgcc 
gggttttccc 
tagggcgaat 
gtgacgaaag 
cggtattatg 
cctttagctg 
cccctacaag 
gagggaactg 
ggcaacataa 
gcaaaggact 
gcttaccgac 
gataaaatat 
ttggaagaga 
atctaccgta 
gaacccgaaa 
ggatataagg 
aagtgggcaa 
cgcaaattcg 
attcaagggc 
gcgaaagagc 
cccggtaccc 
gtcatagctg 
cggaagcata 
gttgcgctca 
cggccaacgc 
tgactcgctg 
aatacggtta 
gcaaaaggcc 
ccctgacgag 
ataaagatac 
gccgcttacc 
ctcacgctgt 
cgaacccccc 
cccggtaaga 
gaggtatgta 
aaggacagta 
tagctcttga 
gcagattacg 
tgacgctcag 
gatcttcacc 
tgagtaaact 
ctgtctattt 
ggagggctta 
tccagattta 
aactttatcc 
gccagttaat 
gtcgtttggt 
ccccatgttg 
gttggccgca 
gccatccgta 
gtgtatgcgg 
tagcagaact 
gatcttaccg 
agcatctttt 



gttaaatcag 

aagaatagac 

agaacgtgga 

gtgaaccatc 

accctaaagg 

aggaagggaa 

tgcgcgtaac 

cattcaggct 

agctggcgaa 

agtcacgacg 

tggagctcca 

ataagacgcg 

tttacaagct 

acgtcgttga 

cggatccccc 

aacgacgtaa 

aggcgttcta 

acatcaatgc. 

tctttgcacg 

tgaaagcggt 

taaaaaggac 

tcgctctcga 

gggacatttg 

gcgtcttcta 

tagcggactt 

tagcgtctaa 

gtaaaccgac 

aatataaaaa 

agcttttgtt 

tttcctgtgt 

aagtgtaaag 

ctgcccgctt 

gcggggagag 

cgctcggtcg 

tccacagaat 

aggaaccgta 

catcacaaaa 

caggcgtttc 

ggatacctgt 

aggtatctca 

gttcagcccg 

cacgacttat 

ggcggtgcta 

tttggtatct 

tccggcaaac 

cgcagaaaaa 

tggaacgaaa 

tagatccttt 

tggtctgaca 

cgttcatcca 

ccatctggcc 

tcagcaataa 

gcc ; tccatcc 

agtttgcgca 

atggcttcat 

tgcaaaaaag 

gtgttatcac 

agatgctttt 

cgaccgagtt 

ttaaaagtgc 

ctgttgagat 

actttcacca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 



i it" i- ' '■■ >' >| ;i il " 



102 



gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 3780 

cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg 3840 

gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg 3900 

ttccgcgcac atttccccga aaagtgc 3927 



10 



15 



<210> 72 
<211> 3351 
<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> Description of Artificial Sequence: vector 
pBS-SSVs 



50 



15 



40 



45 



50 



55 



60 



65 



<400> 72 

cgacctcgag 

tgccgaagag 

aatcgataat 

agtactttat 

tcacctctac 

tagtccaact 

tcagcacttt 

gcttttgttc 

ttcctgtgtg 

agtgtaaagc 

tgcccgcttt 

cggggagagg 

gctcggtcgt 

ccacagaatc 

ggaaccgtaa 

atcacaaaaa 

aggcgtttcc 

gatacctgtc 

ggtatctcag 

ttcagcccga 

acgacttatc 

gcggtgctac 

ttggtatctg 

ccggcaaaca 

gcagaaaaaa 

ggaacgaaaa 

agatcctttt 

ggtctgacag 

gttcatccat 

catctggccc 

cagcaataaa 

cctccatcca 

gtttgcgcaa 

tggcttcatt 

gcaaaaaagc 

tgttatcact 

gatgcttttc 

gaccgagttg 

taaaagtgct 

tgttgagatc 

ctttcaccag 

taagggcgac 

tttatcaggg 

aaataggggt 

tttgttaaaa 

aatcggcaaa 

agtttggaac 

cgtctatcag 

gaggtgccgt 

gggaaagccg 

ggcgctggca 



tcagacccct 
cgatacgtaa 
atctaacggt 
cgctatagcg 
tctcttcaga 
aagcggataa 
cagtatttcg 
cctttagtga 
aaattgttat 
ctggggtgcc 
ccagtcggga 
cggtttgcgt 
tcggctgcgg 
aggggataac 
aaaggccgcg 
tcgacgctca 
ccctggaagc 
cgcctttctc 
ttcggtgtag 
ccgctgcgcc 
gccactggca 
agagttcttg 
cgctctgctg 
aaccaccgct 
aggatctcaa 
ctcacgttaa 
aaattaaaaa 
ttaccaatgc 
agttgcctga 
cagtgctgca 
ccagccagcc 
gtctattaat 
cgttgttgcc 
cagctccggt 
ggttagctcc 
catggttatg 
tgtgactggt 
ctcttgcccg 
catcattgga 
cagttcgatg 
cgtttctggg 
acggaaatgt 
ttattgtctc 
tccgcgcaca 
ttcgcgttaa 
atcccttata 
aagagtccac 
ggcgatggcc 
aaagcactaa 
gcgaacgtgg 
agtgtagcgg 



tttagccatt 
tgttgcgtta 
acgcttagct 
tccttatgac 
ggcgttatgt 
taacagacgt 
ctcagcctaa 
gggttaattt 
ccgctcacaa 
taatgagtga 
aacctgtcgt 
attgggcgct 
cgagcggtat 
gcaggaaaga 
ttgctggcgt 
agtcagaggt 
tccctcgtgc 
ccttcgggaa 
gtcgttcgct 
ttatccggta 
gcagccactg 
aagtggtggc 
aagccagtta 
ggtagcggtg 
gaagatcctt 
gggattttgg 
tgaagtttta 
ttaatcagtg 
ctccccgtcg 
atgataccgc 
ggaagggccg 
tgttgccggg 
attgctacag 
tcccaacgat 
ttcggtcctc 
gcagcactgc 
gagtactcaa 
gcgtcaatac 
aaacgttctt 
taacccactc 
tgagcaaaaa 
tgaatactca 
atgagcggat 
tttccccgaa 
atttttgtta 
aatcaaaaga 
tattaaagaa 
cactacgtga 
atcggaaccc 
cgagaaagga 
tcacgctgcg 



ccgcatactt 
aaacgcgtgt 
cagccatctt 
gtcgttcaaa 
ggaatacata 
cgttaccgca 
cgcccgactc 
cgagcttggc 
ttccacacaa 
gctaactcac 
gccagctgca 
cttccgcttc 
cagctcactc 
acatgtgagc 
ttttccatag 
ggcgaaaccc 
gctctcctgt 
gcgtggcgct 
ccaagctggg 
actatcgtct 
gtaacaggat 
ctaactacgg 
ccttcggaaa 
gtttttttgt 
tgatcttttc 
tcatgagatt 
aatcaatcta 
aggcacctat 
tgtagataac 
gagacccacg 
agcgcagaag 
aagctagagt 
gcatcgtggt 
caaggcgagt 
cgatcgttgt 
ataattctct 
ccaagtcatt 
gggataatac 
cggggcgaaa 
gtgcacccaa 
caggaaggca 
tactcttcct 
acatatttga 
aagtgccacc 
aatcagctca 
atagaccgag 
cgtggactcc 
accatcaccc 
taaagggagc 
agggaagaaa 
cgtaaccacc 



tttatattgc 
cggtttacgc 
agacgctacg 
gtccgctatt 
gaagacgccc 
aatgtccctt 
gagggggggc 
gtaatcatgg 
catacgagcc 
attaattgcg 
ttaatgaatc 
ctcgctcact 
aaaggcggta 
aaaaggccag 
gctccgcccc 
gacaggacta 
tccgaccctg 
ttctcatagc 
ctgtgtgcac 
tgagtccaac 
tagcagagcg 
ctacactaga 
aagagttggt 
ttgcaagcag 
tacggggtct 
atcaaaaagg 
aagtatatat 
ctcagcgatc 
tacgatacgg 
ctcaccggct 
tggtcctgca 
aagtagttcg 
gtcacgctcg 
tacatgatcc 
cagaagtaag 
tactgtcatg 
ctgagaatag 
cgcgccacat 
actctcaagg 
ctgatcttca 
aaatgccgca 
ttttcaatat 
atgtatttag 
taaattgtaa 
ttttttaacc 
atagggttga 
aacgtcaaag 
taatcaagtt 
ccccgattta 
gcgaaaggag 
acacccgccg 



tctttcgcta 
ccttgaataa 
aatttgcgga 
gcccacttcg 
ttatatcccc 
tcgggttcct 
ccggtaccca 
tcatagctgt 
ggaagcataa 
ttgcgctcac 
ggccaacgcg 
gactcgctgc 
atacggttat 
caaaaggcca 
cctgacgagc 
taaagatacc 
ccgcttaccg 
tcacgctgta 
gaaccccccg 
ccggtaagac 
aggtatgtag 
aggacagtat 
agctcttgat 
cagattacgc 
gacgctcagt 
atcttcacct 
gagtaaactt 
tgtctatttc 
gagggcttac 
ccagatttat 
actttatccg 
ccagttaata 
tcgtttggta 
cccatgttgt 
ttggccgcag 
ccatccgtaa 
tgtatgcggc 
agcagaactt 
atcttaccgc 
gcatctttta 
aaaaagggaa 
tat.tgaagca 
aaaaataaac 
gcgttaatat 
aataggccga 
gtgttgttcc 
ggcgaaaaac 
ttttggggtc 
gagcttgacg 
cgggcgctag 
cgcttaatgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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1 



gcegctacag ggcgcgtccc attcgccatt caggctgcgc aactgttggg aagggcgatc 3120 
ggtgcgggcc tcttcgcta't tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt 3180 
aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt 3240 
gtaatacgac tcactatagg gcgaattgga gctccaccgc ggtggcggcc gctctagaac 3300 
5 tagtggatcc cccgggctgc aggaattcga tatcaagctt atcgataccg t 3351 

<210> 73 
<211> 5730 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
15 pCMVC31 (NNLS) 

<400> 73 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 
i tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 

0 cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 
j atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 
\ tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 
j cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 

tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 
5 cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 
j ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 54 0 

aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 

1 gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 
i attaatacga ctcactatag ggagacecaa gctgactcta gacttaatta agcgttgggg 720 

0 tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 
cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 
ctggtcagaa aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 

1 gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 

j cagggcggcc gcaccatgcc caagaagaag aggaaggtga • cacaaggggt tgtgaccggg 1020 
i5 gtggacacgt acgcgggtgc ttacgaccgt cagtcgcgcg agcgcgagaa ttcgagcgca 1080 
i gcaagcccag cgacacagcg tagcgccaac gaagacaagg cggccgacct tcagcgcgaa 1140 
gtcgagcgcg acgggggccg gttcaggttc gtcgggcatt tcagcgaagc gccgggcacg 1200 
; tcggcgttcg ggacggcgga gcgcccggag ttcgaacgca tcctgaacga atgccgcgcc 1260 
! gggcggctca acatgatcat tgtctatgac gtgtcgcgct tctcgcgcct gaaggtcatg 1320 

40 gacgcgattc cgattgtetc ggaattgctc gccctgggcg tgacgattgt ttccactcag 1380 
gaaggcgtct tccggcaggg aaacgtcatg gacctgattc acctgattat gcggctcgac 1440 
gcgtcgcaca aagaatcttc gctgaagtcg gcgaagattc .tcgacacgaa gaaccttcag 1500 
cgcgaattgg gcgggtacgt cggcgggaag gcgccttacg gcttcgagct tgtttcggag 1560 
acgaaggaga tcacgcgcaa cggccgaatg gtcaatgtcg tcatcaacaa gcttgcgcac 1620 

45 tcgaccactc cccttaccgg acccttcgag ttcgagcccg acgtaatccg gtggtggtgg 1680 
cgtgagatca agacgcacaa acaccttccc ttcaagccgg ■ gcagtcaagc cgccattcac 1740 
ccgggcagca tcacggggct ttgtaagcgc atggacgctg acgccgtgcc gacccggggc 1800 
gagacgattg ggaagaagac cgcttcaagc gcctgggacc cggcaaccgt tatgcgaatc 1860 
cttcgggacc cgcgtattgc gggcttcgcc gctgaggtga tctacaagaa gaagccggac 1920 

50 ggcacgccga ccacgaagat tgagggttac cgcattcagc gcgacccgat cacgctccgg 1980 
ccggtcgagc ttgattgcgg accgatcatc gagcccgctg agtggtatga gcttcaggcg 204 0 
tggttggacg gcagggggcg cggcaagggg ctttcccggg ggcaagccat tctgtccgcc 2100 
atggacaagc tgtactgcga gtgtggcgcc gtcatgactt cgaagcgcgg ggaagaatcg 2160 
atcaaggact cttaccgctg ccgtcgccgg aaggtggtcg acccgtccgc acctgggcag 2220 

55 cacgaaggca cgtgcaacgt cagcatggcg gcactcgaca agttcgttgc ggaacgcatc 228 0 
ttcaacaaga tcaggcacgc cgaaggcgac gaagagacgt tggcgcttct gtgggaagcc 234 0 
gcccgacgct tcggcaagct cactgaggcg cctgagaaga gcggcgaacg ggcgaacctt 2400 
gttgcggagc gcgccgacgc cctgaacgcc cttgaagagc tgtacgaaga ccgcgcggca 24 60 
ggcgcgtacg acggacccgt tggcaggaag cacttccgga agcaacaggc agcgctgacg 2520 

60 ctccggcagc aaggggcgga agagcggctt gccgaacttg aagccgccga agccccgaag 258 0 
cttccccttg accaatggtt ccccgaagac gccgacgctg acccgaccgg ccctaagtcg 2640 
tggtgggggc gcgcgtcagt agacgacaag cgcgtgttcg tcgggctctt cgtagacaag 2700 
atcgttgtca cgaagtcgac tacgggcagg gggcagggaa cgcccatcga gaagcgcgct 2760 
tcgatcacgt gggcgaagcc gccgaccgac gacgacgaag acgacgccca ggacggcacg 2820 

65 . gaagacgtag cggcgtaggc ggcgcccggg ctcgagatcc aggcgcggat caataaaaga 2880 
tcattatttt caatagatct gtgtgttggt tttttgtgtg ccttggggga gggggaggcc 294 0 
agaatgaggc gcggccaagg gggaggggga ggccagaatg accttggggg agggggaggc 3000 



104 



10 



15 



!5 



30 



35 



40 



45 



cagaatgacc 
aattcactgg 
aategccttg 
gatcgccctt 
ctccttacgc 
tctgatgccg 
cgggcttgtc 
atgtgtcaga 
cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
gcccgggcta 



ttgggggagg 
ccgtcgtttt 
cagcacatcc 
cccaacagtt 
atctgtgcgg 
catagttaag 
tgctcccggc 
ggttttcacc 
tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttecaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 
gcttgcatgc 



gggaggccag 
acaacgtcgt 
ccctttcgcc 
gcgcagcctg 
tatttcacac 
ccagccccga 
atccgcttac 
gtcatcaccg 
tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgata^agtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaagcgggc 
ggctttacac 
tca'cacagga 
ctgcaggttt 



aatgaggcgc 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 
agacaagctg 
aaacgcgcga 
ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 



gcccccgggt 
accctggcgt 
atagcgaaga 
ggcgcctgat 
gcactctcag 
cacccgctga 
tgaccgtctc 
gacgaaaggg 
cttagacgtc 
tctaaataca 
aatattgaaa 
ttgcggcatt 
ctgaagatca 
tccttgagag 
tatgtggcgc 
actattctca 
gcatgacagt 
acttaettct 
gggatcatgt 
acgagcgtga 
gcgaactact 
ttgcaggacc 
gagccggtga 
cccgtatcgt 
agatcgctga 
catatatact 
tcctttttga 
cagaccccgt 
gctgcttgca 
taccaactct 
ttctagtgta 
tcgctctgct 
ggttggactc 
cgtgcacaca 
agctatgaga 
gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
ttaccgcctt 
cagtgagcga 
cgattcatta 
acgcaattaa 
cggctcgtat 
aceatgatta 



accgagctcg 
tacccaactt 
ggcccgcacc 
gcggtatttt 
tacaatctgc 
cgcgccctga 
cgggagctgc 
cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgccaagcta 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
•5460 
5520 
5580 
5640 
5700 
5730 



<210> 74 
50 <211> 4886 
<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: vector 
pCMV-SSV 



60 



65 



<400> 74 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 



tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggaetttcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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40 



45 



50 



55 



60 



65 



aaatgtcgta 
gtctatataa 
attaatacga 
tgagtactcc 
cgaggaggat 
ctggtcagaa 
gccatacact 
cagggcggcc 
tacgcgagag 
aagagcgtta 
gggtcgtagg 
gaagcggtgg 
gccaatacgc 
ggataagcga 
cgagagacgc 
tacatgatga 
atatctacat 
gcgaaaacgt 
tactgaaagt 
cgcttagttg 
agagagtaga 
tagcgataaa 
tagatattat 
tatcgctctt 
tctgactcga 
tattttcaat 
tgaggcgcgg 
atgaccttgg 
cactggccgt 
gccttgcagc 
gcccttccca 
ttacgcatct 
atgccgcata 
cttgtctgct 
gtcagaggtt 
tatttttata 
ggggaaatgt 
cgctcatgag 
gtattcaaca 
ttgctcaccc 
tgggttacat 
aacgttttcc 
ttgacgccgg 
agtactcacc 
gtgctgccat 
gaccgaagga 
gttgggaacc 
tagcaatggc 
ggcaacaatt 
cccttccggc 
gtatcattgc 
cggggagtca 
tgattaagca 
aacttcattt 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
ttcctgcgtt 



acaactccgc 
gcagagctct 
ctcactatag 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
gcccgatatg 
gaaagggcgg 
cgtgggtcct 
ggatactccc 
tggaaaagag 
gacggacggc 
aaaaactgca 
acagaaggct 
atttgcggat 
tccaacgttg 
ctacttcatc 
gctgaaggaa 
gactagggga 
ggtgacgaag 
gtacttccgc 
cgattttatt 
cggeatagcg 

gggggggccc 

agatctgtgt 
ccaaggggga 
gggaggggga 
cgttttacaa 
acatccccct 
acagttgcgc 
gtgcggtatt 
gttaagccag 
cccggcatcc 
ttcaccgtca 
ggttaatgtc 
gcgcggaacc 
acaataaccc 
tttcegtgtc 
agaaacgctg 
cgaactggat 
aatgatgagc 
gcaagagcaa 
agtcacagaa 
aaccatgagt 
gctaaccgct 
ggagctgaat 
aacaacgttg 
aatagactgg 
tggctggttt 
agcactgggg 
ggcaactatg 
ttggtaactg 
ttaatttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggteg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 
atcccctgat 



cccattgacg 
ctggctaact 
ggagacccaa 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
acgaaagata 
tattatgttt 
ttagctgacg 
ctacaagcgg 
ggaactgaac 
aacataaagg 
aaggactaca 
taccgaetct 
aaaatattga 
gaagagataa 
taccgtatcg 
cccgaaaggg 
tataagggcg 
tgggcaatag 
aaattcgtag 
caagggcgta 
aaagagcaat 
gtcgacctcg 
gttggttttt 
gggggaggcc 
ggccagaatg 
cgtcgtgact 
ttcgccagct 
agcctgaatg 
tcacaccgca 
ccccgacaec 
gcttacagac 
tcaccgaaac 
atgataataa 
cctatttgtt 
tgataaatgc 
gcccttattc 
gtgaaagtaa 
ctcaacagcg 
acttttaaag 
ctcggtcgcc 
aagcatctta 
gataacactg 
tttttgcaca 
gaagccatac 
cgcaaactat 
atggaggcgg 
attgctgata 
ccagatggta 
gatgaacgaa 
tcagaccaag 
aggatctagg 
tcgttccact 
tttctgcgcg 
ttgccggatc 
ataccaaata 
gcaccgccta 
aagtcgtgtc 
ggctgaacgg 
agatacctac 
aggtatccgg 
aacgcctggt 
ttgtgatgct 
cggttcctgg 
tctgtggata 



caaatgggcg 

agagaaccca 

gctgactcta 

ttctgcgcta 

ggtgatgcct 

aagcttgagg 

tttgcctttc 

agacgcgtta 

acaagctaga 

tcgttgaatc 

atccccccgg 

gacgtaaaat 

cgttctacaa 

tcaatgctat 

ttgcacgttt 

aagcggtaaa 

aaaggacgtt 

ctctcgagtc 

acatttgcgg 

tcttctatgt 

cggactttga 

cgtctaagat 

aaccgacacg 

ataaaaagta 

agatccaggc 

tgtgtgcctt 

agaatgacct 

aggcgcgccc 

gggaaaaccc 

ggcgtaatag 

gcgaatggcg 

tatggtgcac 

cgccaacacc 

aagctgtgac 

gcgcgagacg 

tggtttctta 

tatttttcta 

ttcaataata 

ccttttttgc 

aagatgctga 

gtaagatcct 

ttctgctatg 

gcatacacta 

cggatggcat 

cggccaactt 

acatggggga 

caaacgacga 

taactggcga 

ataaagttgc 

aatctggagc 

agccctcccg 

atagacagat 

tttactcata 

tgaagatcct 

gagcgtcaga 

taatctgctg 

aagagctacc 

ctgtccttct 

catacctcgc 

ttaccgggtt 

ggggttcgtg 

agcgtgagct 

taagcggcag 

atctttatag 

cgtcaggggg 

ccttttgctg 

accgtattac 



gtaggcgtgt 
ctgcttactg 
gacttaatta 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctccacagg 
taaatacggg 
gtatgaaaac 
atatctaaaa 
tttcgagccc 
agcgttggtt 
ctatctcatg 
atcaaagccg 
cttagcgtca 
ggtgaagaag 
acaattagca 
gggcgttagg 
taacgacgtc 
attccacata 
acgacgtcat 
ggctgagcta 
cgttttaacg 
tgcggaatgg 
gcggatcaat 
gggggagggg 
tgggggaggg 
ccgggtaccg 
tggcgttacc 
cgaagaggcc 
cctgatgcgg 
tctcagtaca 
cgctgacgcg 
cgtctccggg 
aaagggcctc 
gacgtcaggt 
aatacattca 
ttgaaaaagg 
ggcattttgc 
agatcagttg 
tgagagtttt 
tggcgcggta 
ttctcagaat 
gacagtaaga 
acttctgaca 
tcatgtaact 
gcgtgacacc 
actacttact 
aggaccactt 
cggtgagcgt 
tatcgtagtt 
cgctgagata 
tatactttag 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 
cgcctttgag 



acggtgggag 
gcttatcgaa 
agcgttgggg 
tttccaaaaa 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
gattatattt 
ggtgaggtaa 
atgaaattag 
gggacaagcg 
gccaatttgc 
aacgaaaggg 
tataaagaga 
cgcaatatca 
gcgaacgctg 
aaagactata 
ctgagcgaaa 
tgttattatc 
acgcctctga 
aaggacgcta 
agcgtaccgt 
caacattacg 
ctaaaagggg 
aaaagatcat 
gaggccagaa 
ggaggccaga 
agctcgaatt 
caacttaatc 
cgcaccgatc 
tattttctcc 
atctgctctg 
ccctgacggg 
agctgcatgt 
gtgatacgcc 
ggcacttttc 
aatatgtatc 
aagagtatga 
cttcctgttt 
ggtgcacgag 
cgccccgaag 
ttatcccgta 
gacttggttg 
gaattatgca 
acgatcggag 
cgccttgatc 
acgatgcctg 
ctagcttccc 
ctgcgctcgg 
gggtctcgcg 
atctacacga 
ggtgcctcac 
attgatttaa 
ctcatgacca 
aagatcaaag 
aaaaaaccac 
ccgaaggtaa 
tagttaggcc 
ctgttaccag 
cgatagttac 
agcttggagc 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatgttct 
tgagctgata 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 




ccgctcgccg cagccgaacg accgagcgca 
gcccaatacg caaaccgcct ctccccgcgc 

acaggtttcc cgactggaaa gcgggcagtg 
ctcattaggc accccaggct ttacacttta 
tgagcggata acaatttcac acaggaaaca 
gggctagctt gcatgcctgc aggttt 



106 




gcgagtcagt gagcgaggaa gcggaagagc 4620 
gttggccgat tcattaatgc agctggcacg 4680 
agcgcaacgc aattaatgtg agttagctca 4740 
tgcttccggc tcgtatgttg tgtggaattg 4800 
gctatgacca tgattacgcc aagctagccc 4860 

4886 
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<220> 

<223> Description of Artificial Sequence: vector 
pCMV-SSV(NNLS) 



<400> 75 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300- 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 
atrlttaftt aCaaCtC ^ cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 
attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 
tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 
cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 
alrttan 9 *? t***™*^ ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 
| gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 

OD cagggcggcc gcaccatgcc caagaagaag aggaaggtga cgaaagataa gacgcgttat 1020 
, aaatacgggg attatatttt acgcgagagg aaagggcggt attatgttta caagctagag 1080 

JSnf« aC9 ^gaggtaaa agagcgttac gtgggtcctt tagctgacgt cgttgaatca 1140 
tatctaaaaa tgaaattagg ggtcgtaggg gatactcccc tacaagcgga tccccccggt 1200 
ttcgagcccg ggacaagcgg aagcggtggt ggaaaagagg gaactgaacg acgtaaaata 12 60 
gcgttggttg ccaatttgcg ccaatacgcg acggacggca acataaaggc gtLtacaac 1320 
tca«SS£ ^^aagggg gataagcgaa aaaactgcaa aggactacat caatgctata 1380 
Itt^fZ ataaagagac gagagacgca cagaaggctt accgactctt tgcacgtttc 14 40 
ttagcgtcac gcaatatcat acatgatgaa tttgcggata aaatattgaa agcggtaaag 1500 

45 caatSacaa SIITF* tatctacatt ccaaogttgg .aagagataaa aaggacgtta isiS 
<K caattagcaa aagactatag cgaaaacgtc tacttcatct accgtatcgc tctcgagtcg 1620 

alTaTalct SKST* ac ^ aa f^ ^tgaaggaac ccgaaaggga catttgcgg? 1680 
KccacSS g ctta gttgg actaggggat ataagggcgt cttctatgta 1740 

ttccacataa cgcctctgaa gagagtagag gtgacgaagt gggcaatagc ggactttqaa 1800 
50 ^ gaCgtCata aggacgctat agcgataaag tacttccgca aa?tcgtagc gLtaagSg I8 60 
Itililtlar ^f^l ^attatc gattttattc aagggcgtaa Iccgaclcgc I 9 20 
?"" aaCgC aaca ttacgt atcgctcttc ggcatagcga aagagcaata taaaaagtat 1980 
laHtrJr taaaaggg f ctgactcgag ggggggcccg tcgacctcga gatccaggcg 2040 
cggatcaata aaagatcatt attttcaata gatctgtgtg ttggtttttt gtgtgccttq 2100 
ggggaggggg aggccagaat gaggcgcggc caagggggag ggggaggcca gaatgacctt 2160 
55 gggggagggg gaggccagaa tgaccttggg ggagggggag gccagaatga ggcgcgcccc 2220 
cgggtaccga gctcgaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 2280 
ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 2340 
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgLtggcgc 2400 
ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat atggtgcact 2460 
ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc 2520 
gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 2580 
gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgagacga 2640 
aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataa? ggt?tc?tag 2700 
acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctal 2760 
atacattcaa atatgtatcc gctcatgaga caataaccct gataaa?gct tcaataSa? 282^ 
tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttat?cc cttttSgcq lllo 
gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgc^gaa lllo 



• I io7 ; # 



gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt 30.00 
gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt 3060 
ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat 3120 
tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg 3180 
5 acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta 3240 
cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat 3300 
catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag 3360 
cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa 3420 
ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca 3480 

10 ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc 3540 
ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt 3600 
atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc 3660 
gctgagatag gtgcctcact gattaagcat tggtaactgt cagaceaagt ttactcatat 3720 
atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt 3780 

15 tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac 3840 
cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc 3900 
ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca 3960 
actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta 4020- 
1 gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct 4080 
0 ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg 4140 
j gactcaagac gatagttacc ggataaggcg cageggtcgg gctgaacggg gggttcgtgc 4200 
acacagccca gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta 4260 

! tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg 4320 
gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt 4380 

15 cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg 4440 

i cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg 4500 
ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc 4560 
gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg 4620 
agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt 4680 

50 cattaatgca gctggcacga- caggtttccc gactggaaag cgggcagtga gcgcaacgca 47 4 0 
attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct 4800 
cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat 4860 

: gattacgcca agctagcccg ggctagcttg catgcctgca ggttt - 4905 

35 

<210> 76 
<211> 5290 
; <212> DNA 

<213> Artificial Seguence 

40 

<220> 

<223> Description of Artificial Sequence: vector 
pCMVXisA '.: . " • 

45 <400> 76 

agtccgatgt acgggccaga tatacgcgtt gacattgatt attgactagt tattaatagt 60 
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 120 
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 180 
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggactatt 24 0 

50 tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta 300 
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg 360 
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 420 
tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 480 
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 540 

55 gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 600 
atataagcag agctctctgg ctaactagag aacccactgc ttactggctt atcgaaatta 660 
atacgactca ctatagggag acccaagctg actctagact taattaagcg ttggggtgag 720 
tactccctct caaaagcggg catgacttct gcgctaagat tgtcagtttc caaaaacgag 780 
gaggatttga tattcacctg gcccgcggtg atgcctttga gggtggccgc gtccatctgg 840 

60 tcagaaaaga caatcttttt gttgtcaagc ttgaggtgtg gcaggcttga gatctggcca 900 
tacacttgag tgacattgac atccactttg cctttctctc cacaggtgtc cactcccagg 960 
gcggccgccc gatatgcaaa atcagggtca agacaaatat caacaagcct ttgcagactt 1020 
agagccactc tcatctaccg acggcagttt tctcggctca agtctgcaag cacagcagca 1080 
aagagaacac atgagaacaa aagtactaca agacctagac aaggtaaatc tgcgtttgaa 1140 

65 gtctgcaaag acgaaagtct cagttcgaga atctaacgga agtctgcaat tacgagcaac 1200 
gttaccaatt aaacctggag ataaggacac caacggtaca ggcagaaagc aatacaatct 1260 
cagcttgaat atccctgcaa acttggatgg actgaagacg gctgaggaag aagcttatga 1320 



attaggtaaa 
agccactaaa 
ttttaaaacc 
cacccaacga 
tgagcaaatc 
cataacattc 
gcgcaacatc 
agttaccaga 
gacatatgga 
tattgattgg 
taaaactggt 
aagaaatccg 
.tcatgctgaa 
agattttaaa 
accaatcaaa 
. tcagcgctgg 
gaatgaattt 
gaggatggaa 
gtcgacctcg 
gttggttttt 
gggggaggcc 
ggccagaatg 
cgtegtgact 
ttcgccagct 
agcctgaatg 
tcacaccgca 
ccccgacacc 
gcttacagac 
tcaccgaaac 
atgataataa 
cctatttgtt 
tgataaatgc 
gcccttattc 
gtgaaagtaa 
ctcaacagcg 
acttttaaag 
ctcggtcgcc 
aagcatctta 
gataacactg 
tttttgcaca 
gaagccatac 
cgcaaactat 
atggaggcgg 
attgctgata 
ccagatggta 
gatgaacgaa 
tcagaccaag 
aggatctagg 
tcgttccact 
tttctgcgcg 
ttgccggatc 
ataccaaata 
gcaccgccta 
aagtcgtgtc 
ggctgaacgg 
agatacctac 
aggtatccgg 
aacgcctggt 
ttgtgatgct 
cggttcctgg 
tctgtggata 
accgagcgca 
ctccccgcgc 
gcgggcagtg 
ttacacttta 
acaggaaaca 
aggtttaaac 



ttaatcgctc 
aaagattcac 
cataaacgca 
tataccaatt 
gataaagaat 
aatatagaaa 
cccacagatg 
ggaaatcaag 
atgttagcag 
tggttaagca 
gaaagacaag 
aaatatttag 
ataacagcct 
ccctatgatt 
gcggcggctg 
ttctcgctag 
gaggtgatta 
attgagaagt 
agatccaggc 
tgtgtgcctt 
agaatgacct 
aggcgcgccc 
gggaaaaccc 
ggcgtaatag 
gcgaatggcg 
tatggtgcac 
cgccaacacc 
aagctgtgac 
gcgcgagacg 
tggtttctta 
tatttttcta 
ttcaataata 
ccttttttgc 
aagatgctga 
gtaagatcct 
ttctgctatg 
gcatacacta 
cggatggcat 
cggccaactt 
acatggggga 
caaacgacga 
taactggcga 
ataaagttgc 
aatctggagc 
agccctcccg 
atagacagat 
tttactcata 
tgaagatcct 
gagcgtcaga 
taatctgctg 
aagagctacc 
ctgtccttct 
catacctcgc 
ttaccgggtt 
ggggttcgtg 
agcgtgagct 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 
accgtattac 
gcgagtcagt 
gttggccgat 
agcgcaacgc 
tgcttccggc 
gctatgacca 



ggaaaacctt 
aaacaatagg 
ccactaaaag 
ccaaagattt 
gggcgagata 
ttgatttgtc 
cagaaatact 
ttaatgaaga 
tttttggttt 
aagagaatat 
cattaccctt 
aaatgctggc 
taactcagcg 
tacgtcacgc 
ataatttggg 
atatgcggaa 
gggaggagaa 
taaagatgga 
gcggatcaat 
gggggagggg 
tgggggaggg 
ccgggtaccg 
tggcgttacc 
cgaagaggcc 
cctgatgcgg 
tctcagtaca 
cgctgacgcg 
cgtctccggg 
aaagggcctc 
gacgtcaggt 
aatacattca 
ttgaaaaagg 
ggcattttgc 
agatcagttg 
tgagagtttt 
tggcgcggta 
ttctcagaat 
gacagtaaga 
acttctgaca 
tcatgtaact 
gcgtgacacc 
actacttact 
aggaccactt 
cggtgagcgt 
tatcgtagtt 
cgctgagata 
tatactttag 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 
cgcctttgag 
gagcgaggaa 
tcattaatgc 
aattaatgtg 
tcgtatgttg 
tgattacgcc 



tgaatggaat 
tgatttacta 
cgaacatacc 
agcaacggcg 
taatgccgcc 
ccagtattcc 
atcaggaatt 
tgtaaaagat 
acgccccagg 
agacctcaca 
acataaagaa 
aacagcaatt 
tattagttgg 
ctgggcaatt 
gcatagtatg 
gttagcgatt 
tgctaaattg 
aatagcttat 
aaaagatcat 
gaggccagaa 
ggaggccaga 
agctcgaatt 
caacttaatc 
cgcaccgatc 
tattttctcc 
atctgctctg 
ccctgacggg 
agctgcatgt 
gtgatacgcc 
ggcacttttc 
aatatgtatc 
aagagtatga 
cttcctgttt 
ggtgcacgag 
cgccccgaag 
ttatcccgta 
gacttggttg 
gaattatgca 
acgatcggag 
cgccttgatc 
acgatgcctg 
ctagcttccc 
ctgcgctcgg 
gggtctcgcg 
atctacacga 
ggtgcctcac 
attgatttaa 
ctcatgacca 
aagatcaaag 
aaaaaaccac 
ccgaaggtaa 
tagttaggcc 
ctgttaccag 
cgatagttac 
agcttggagc 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatgttct 
tgagctgata 
gcggaagagc 
agctggcacg 
agttagctca 
tgtggaattg 
aagctagccc 



gataaatatt 
gaaaaatttg 
tttttttact 
gaaaatctca 
agagccatat 
aaaatgcctg 
accaaatttg 
agctggcaac 
gaaattttta 
tggaaagtag 
tggattgatg 
agtaaaaaag 
tggtttcgga 
agagcgcata 
caggttcata 
aatcaggctt 
cagatagaaa 
aagaatagtt 
tattttcaat 
tgaggcgcgg 
atgaccttgg 
cactggccgt 
gccttgcagc 
gcccttccca 
ttacgcatct 
atgccgcata 
cttgtctgct 
gtcagaggtt 
tatttttata 
ggggaaatgt 
cgctcatgag 
gtattcaaca 
ttgctcaccc 
tgggttacat 
aacgttttcc 
ttgacgccgg 
agtactcacc 
gtgctgccat 
gaccgaagga 
gttgggaacc 
tagcaatggc 
ggcaacaatt 
cccttccggc 
gtatcattgc 

cggggagtca 

tgattaagca 
aacttcattt 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
ttcctgcgtt 
ccgctcgccg 
gcccaatacg 
acaggtttcc 
ctcattaggc 
tgagcggata 
gggctagctt 



taggcaaaga 
cagaagagta 
atttttcccg 
tcaattcaat 
cagctttttg 
atcgcaattc 
aagactatct 
tttggcgctg 
ttaaccctaa 
acaaagaatg 
agtttgattt 
ataaaacaaa 
aagtcgaatt 
ttttaggcat 
cacaaaccta 
tgactaagag 
atgaaaggtt 
gagcggccgc 
agatctgtgt 
ccaaggggga 
gggaggggga 
cgttttacaa 
acatccccct 
acagttgcgc 
gtgcggtatt 
gttaagccag 
cccggcatcc 
ttcaccgtca 
ggttaatgtc 
gcgcggaacc 
acaataaccc 
tttccgtgtc 
agaaacgctg 
cgaactggat 
aatgatgagc 
gcaagagcaa 
agtcacagaa 
aaccatgagt 
gctaaccgct 
ggagctgaat 
aacaacgttg 
aatagactgg 
tggctggttt 
agcactgggg 
ggcaactatg 
ttggtaactg 
ttaatttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggtcg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 
atcccctgat 
cagccgaacg 
caaaccgcct 
cgactggaaa 
accccaggct 
acaatttcac 
gcatgcctgc 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5290 
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<210> 77 
<211> 5309 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector 
pCMVXisANNLS 
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<400> 77 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

tgagtactcc 

cgaggaggat 

ctggtcagaa 

gccatacact 

cagggcggcc 

tatcaacaag 

tcaagtctgc 

gacaaggtaa 

ggaagtctgc 

acaggcagaa 

acggctgagg 

aatgataaat 

ctagaaaaat 

accttttttt 

gcggaaaatc 

gccagagcca 

tccaaaatgc 

attaccaaat 

gatagctggc 

agggaaattt 

acatggaaag 

gaatggattg 

attagtaaaa 

tggtggtttc 

atcagagcgc 

atgcaggttc 

attaatcagg 

ttgcagatag 

tataagaata 

cattattttc 

gaatgaggcg 

agaatgacct 

attcactggc 

atcgccttgc 

atcgcccttc 

tccttacgca 

ctgatgccgc 

gggcttgtct 

tgtgtcagag 

gcctattttt 

ttcggggaaa 

atccgctcat 

tgagtattca 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
gcaccatgcc 
cctttgcaga 
aagcacagca 
atctgcgttt 
aattacgagc 
agcaatacaa 
aagaagctta 
atttaggcaa 
ttgcagaaga 
actatttttc 
tcatcaattc 
tatcagcttt 
ctgatcgcaa 
ttgaagacta 
aactttggcg 
ttattaaccc 
tagacaaaga 
atgagtttga 
aagataaaac 
ggaaagtcga 
atattttagg 
atacacaaac 
ctttgactaa 
aaaatgaaag 
gttgagcggc 
aatagatctg 
cggccaaggg 
tgggggaggg 
cgtcgtttta 
agcacatccc 
ccaacagttg 
tctgtgcggt 
atagttaagc 
gctcccggca 
gttttcaccg 
ataggttaat 
tgtgcgcgga 
gagacaataa 
acatttccgt 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
caagaagaag 
cttagagcca 
gcaaagagaa 
gaagtctgca 
aacgttacca 
tctcagcttg 
tgaattaggt 
agaagccact 
gtattttaaa 
ccgcacccaa 
aattgagcaa 
ttgcataaca 
ttcgcgcaac 
tctagttacc 
ctggacatat 
taatattgat 
atgtaaaact 
tttaagaaat 
aaatcatgct 
attagatttt 
cataccaatc 
ctatcagcgc 
gaggaatgaa 
gttgaggatg 
cgcgtcgacc 
tgtgttggtt 
ggagggggag 
ggaggccaga 
caacgtcgtg 
cctttcgcca 
cgcagcctga 
atttcacacc 
cagccccgac 
tccgcttaca 
tcatcaccga 
gtcatgataa 
acccctattt 
ccctgataaa 
gtcgccctta 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
aggaaggtgc 
ctttcatcta 
cacatgagaa 
aagacgaaag 
attaaacctg 
aatatccctg 
aaattaatcg 
aaaaaagatt 
acccataaac 
cgatatacca 
atcgataaag 
ttcaatatag 
atccccacag 
agaggaaatc 
ggaatgttag 
tggtggttaa 
ggtgaaagac 
ccgaaatatt 
gaaataacag 
aaaccctatg 
aaagcggcgg 
tggttctcgc 
tttgaggtga 
gaaattgaga 
tcgagatcca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
actgggaaaa 
gctggcgtaa 
atggcgaatg 
gcatatggtg 
acccgccaac 
gacaagctgt 
aacgcgcgag 
taatggtttc 
gtttattttt 
tgcttcaata 
ttcccttttt 



gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gacttaatta 

agattgtcag 

ttgagggtgg 

tgtggcaggc 

tctccacagg 

aaaatcaggg 

ccgacggcag 

caaaagtact 

tctcagttcg 

gagataagga 

caaacttgga 

ctcggaaaac 

cacaaacaat 

gcaccactaa 

attccaaaga 

aatgggcgag 

aaattgattt 

atgcagaaat 

aagttaatga 

cagtttttgg 

gcaaagagaa 

aagcattacc 

tagaaatgct 

ccttaactca 

atttacgtca 

ctgataattt 

tagatatgcg 

ttagggagga 

agttaaagat 

ggcgcggatc 

cttgggggag 

ccttggggga 

cccccgggta. 

ccctggcgtt 

tagcgaagag 

gcgcctgatg 

cactctcagt 

acccgctgac 

gaccgtctcc 

acgaaagggc 

ttagacgtca 

ctaaatacat 

atattgaaaa 

tgcggcattt 



tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
agcgttgggg 
tttccaaaaa 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
tcaagacaaa 
ttttctcggc 
acaagaccta 
agaatctaac 
caccaacggt 
tggactgaag 
ctttgaatgg 
aggtgattta 
aagcgaacat 
tttagcaacg 
atataatgcc 
gtcccagtat 
actatcagga 
agatgtaaaa 
tttacgcccc 
tatagacctc 
cttacataaa 
ggcaacagca 
gcgtattagt 
cgcctgggca 
ggggcatagt 
gaagttagcg 
gaatgctaaa 
ggaaatagct 
aataaaagat 
ggggaggcca 
gggggaggcc 
ccgagctcga 
acccaactta 
gcccgcaccg 
cggtattttc 
acaatctgct 
gcgccctgac 
gggagctgca 
ctcgtgatac 
ggtggcactt 
tcaaatatgt 
aggaagagta 
tgccttcctg 



60 

120 

180 

240- 

300 

360 

420 

480 
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600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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1620 

1680 
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2040 
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2940 
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tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 3360 
gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 3420 
aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 3480 
gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 3540 
ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 3600 
gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 3660 
gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 3720 
atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 3780 
ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 3840 
cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 3900 
cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 3960 
gcggtatcat t-gcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4020 
cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4080 
cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4140 
taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 4200 
ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 4260 
aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 4320- 
caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 4380 
taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 4440 
gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 4500 
cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 4560 
taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 4 620 
agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 4 680 
ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg eagggtcgga acaggagagc 4740 
gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 4 800 
acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 4860 
acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 4 920 
tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 4 980 
ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5040 
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5100 
acgacaggtt tccegactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 5160 
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 5220 
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagctag 5280 
cccgggctag cttgcatgcc tgcaggttt 5309 



<210> 78 
<211> 7608 
<212> DNA 
40 <213> Artificial Seguence 



<220> 

<223> Description of Artificial Sequence: vector 
pPGKnifD 

<400> 78 

tcgaggaatt ctaccgggta ggggaggcgc ttttcccaag gcagtctgga gcatgcgctt 60 
tagcagcccc gctgggcact tggcgctaca caagtggcct ctggctcgca cacattccac 120 
atccaccggt aggcgccaac cggctccgtt ctttggtggc cccttcgcgc caccttctac 180 
tcctccccta gtcaggaagt tcccccccgc cccgcagctc gcgtcgtgca ggacgtgaca 240 
aatggaagta gcacgtctca ctagtctcgt gcagatggac agcaccgctg agcaatggaa 300 
gcgggtaggc ctttggggca gcggccaata gcagctttgc tccttcgctt tctgggctca 360 
gaggctggga aggggtgggt ccgggggcgg gctcaggggc gggctcaggg gcggggcggg 420 
cgcccgaagg tcctccggag gcccggcatt ctgcacgctt caaaagcgca cgtctgccgc 480 
gctgttctcc tcttcctcat ctccgggcct ttcgacctgc agcccggtac agttcgaatg 540 
gctcttccct tccgtcaaat gcactcttgg gattactccg aacctagcga tggggtgcaa 600 
atgtcagatc agataagttc gaataacttc gtatagcata cattatacga agttataagc 660 
ttgcatgcct gcaggtcggc cgccacgacc ggccggccgg tgccgccacc atcccctgac 720 
ccacgcccct gacccctcac aaggagacga ccttccatga ccgagtacaa gcccacggtg 780 
cgcctcgcca cccgcgacga cgtcccccgg gccgtacgca ccctcgccgc cgcgttcgcc 840 
gactaccccg ccacgcgcca caccgtcgac ccggaccgcc acatcgagcg ggtcaccgag 900 
ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg ggtcgcggac 960 
gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg ggcggtgttc 1020 
gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc gcagcaacag 1080 
atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct ggccaccgtc 1140 
ggcgtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct ccccggagtg 1200 
gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc ccgcaacctc 1260 
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ccettctacg 
gcgacctggt 
ccgaccgaaa 
cgaccccgca 
atttgtagag 
taaaatgaat 
aagcaatagc 
tttgtccaaa 
accacaaaaa 
cgtatagcat 
tggccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 
cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 
tggatgagca 
tgcgctgttc 
tgtatgtggt 
ccgatgatcc 
atcgtaatca 
atcacgacgc 
aaggcggcgg 
atgaagacca 
ctggagagac 
gcggtttcgc 
tctgggactg 
cttacggcgg 
tctttgccga 
tccagttccg 
gcgataacga 
aagtgcctct 
agccggagag 
catggtcaga 
gtgtgacgct 
tttgcatcga 
agatgtggat 
caccgctgga 
tcgaacgctg 
cagatacact 
aaaccttatt 
ccgttgatgt 
agctggcgca 
ccgaccgcct 
ccccgtacgt 
gcccacacca 
tgatggaaac 
acggtttcca 
tccagctgag 
cgggcagggg 
ctacctacag 
aaactactga 
gagcagtggt 
cacaactaga 
atttgtaacc 



agcggctcgg 
gcatgacccg 
ggagcgcacg 
cccgcccccg 
gttttacttg 
gcaattgttg 
atcacaaatt 
ctcatcaatg 
ggattactcc 
acattatacg 
tttacaacgt 
tccccctttc 
gttgcgcagc 
ggaaagctgg 
gcagatgcac 
tccgccgttt 
tgaaagctgg 
tcatctgtgg 
atttgacctg 
ttggagtgac 
tgacgtctcg 
ctttaatgat 
gcgtgactac 
caccgcgcct 
actacgtctg 
tgcggtggtt 
cggtttccgc 
gattcgaggc 
gacgatggtg 
gcattatccg 
ggatgaagcc 
gcgctggcta 
cccgagtgtg 
gctgtatcgc 
agccgacacc 
gcccttcccg 
gcgcccgctg 
taaatactgg 
ggtggatcag 
tgattttggc 
ccgcacgccg 
tttatccggg 
gctcctgcac 
ggatgtcgct 
cgccgggcaa 
agccgggcac 
ccccgccgcg 
gctgggtaat 
tggcgataaa 
taacgacatt 
gaaggcggcg 
tgctgatgcg 
tatcagccgg 
tgaagtggcg 
ggtagcagag 
tactgccgcc 
cttcccgagc 
gtggcgcggc 
cagccatcgc 
tatggggatt 
cgccggtcgc 
ggatctttgt 
agatttaaag 
ttctaattgt 
ggaatgccag 
atgcagtgaa 
attataagct 



cttcaccgtc 
caagcccggt 
accccatggc 
aggcccaccg 
ctttaaaaaa 
ttgttaactt 
tcacaaataa 
tatcttatca 
ggcccttatc 
aagttatacc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 
ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 
caggatatcc 
aaccatccgc 
aatattgaaa 
ccggcgatga 
atcatctggt 
tggatcaaat 
acggccaccg 
gctgtgccga 
atcctttgcg 
caggcgtttc 
tcgctgatta 
gatacgccga 
catccagcgc 
caaaccatcg 
tggatggtgg 
ccacaaggta 
ctctggctca 
atcagcgcct 
tcccacgcca 
aagcgttggc 
aaacaactgc 
ggcgtaagtg 
ggccattacc 
gtgctgatta 
aaaacctacc 
agcgatacac 
cgggtaaact 
tgttttgacc 
gaaaacggtc 
gacttccagt 
catctgctgc 
ggtggcgacg 
taccattacc 
gaaggaacct 
ctctaaggta 
ttgtgtattt 
atccagacat 
aaaaatgctt 
gcaataaaca 



accgccgacg 
gcctgacgcc 
tccgaccgaa 
actctagagg 
cctcccacac 
gtttattgca 
agcatttttt 
tgtctggatc 
acggttacga 
gggccaccat 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 
cgcccatcta 
agaatccgac 
gccagacgcg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaact 
ccgccgacgg 
ttgaaaatgg 
acgagcatca 
tgctgatgaa 
tgtggtacac 
cccacggcat 
gcgaacgcgt 
cgctggggaa 
ctgtcgatcc 
atattatttg 
aatggtccat 
aatacgccca 
gtcagtatcc 
aatatgatga 
acgatcgcca 
tgacggaagc 
aagtgaccag 
cgctggatgg 
aacagttgat 
cagtacgcgt 
ggcagcagtg 
tcccgcatct 
aatttaaccg 
tgacgccgct 
aagcgacccg 
aggccgaagc 
cgaccgctca 
ggattgatgg 
cgcatccggc 
ggctcggatt 
gctgggatct 
tgcgctgcgg 
tcaacatcag 
acgcggaaga 
actcctggag 
agttggtctg 
tacttctgtg 
aatataaaat 
tagattccaa 
gataagatac 
tatttgtgaa 
agttaacaac 



tcgagtgccc 
cgccccacga 
gccgacccgg 
atcataatca 
ctccccctga 
gcttataatg 
tcactgcatt 
cagctgttga 
cggatttgga 
ggtcgcgagt 
cgttacccaa 
agaggcccgc 
tgcctggttt 
ggccgatact 
caccaacgta 
gggttgttac 
aattattttt 
ttacggccag 
agaaaaccgc 
ggatatgtgg 
acaaatcagc 
ggaggctgaa 
atggcagggt 
tgagcgtggt 
gtggagcgcc 
cacgctgatt 
tctgctgctg 
tcctctgcat 
gcagaacaac 
gctgtgcgac 
ggtgccaatg 
aacgcgaatg 
tgaatcaggc 
ttcccgcccg 
cccgatgtac 
caaaaaatgg 
cgcgatgggt 
ccgtttacag 
aaacggcaac 
gttctgtatg 
aaaacaccag 
cgaatacctg 
taagccgctg 
tgaactgcct 
agtgcaaccg 
gcgtctggcg 
gaccaccagc 
ccagtcaggc 
gcgcgatcag 
cattgaccct 
agcgttgttg 
cgcgtggcag 
tagtggtcaa 
gcggattggc 
agggccgcaa 
gccattgtca 
gacgcgcgaa 
ccgctacagt 
aggcacatgg 
cccgtcagta 
gtgtcaaaaa 
gtgtgacata 
ttttaagtgt 
cctatggaac 
attgatgagt 
atttgtgatg 
aacaattgca 



gaaggaccgc 
cccgcagcgc 
gcggccccgc 
gccataccac 
acctgaaaca 
gttacaaata 
ctagttgtgg 
aagctattaa 
tccataactt 
agcttggcac 
cttaatcgcc 
accgatcgcc 
ccggcaccag 
gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gacagtcgtt 
ctcgcggtga 
cggatgagcg 
gatttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 
ggtcaggtca 
tttaacgccg 
cgctacggcc 
aatcgtctga 
gtgcagcgcg 
cacggcgcta 
gtgcagtatg 
gcgcgcgtgg 
ctttcgctac 
aacagtcttg 
ggcggcttcg 
ccgtggtcgg 
aacggtctgg 
cagcagtttt 
ttccgtcata 
gcaagcggtg 
gaactaccgc 
aacgcgaccg 
gaaaacctca 
gaaatggatt 
tttctttcac 
ttcacccgtg 
aacgcctggg 
cagtgcacgg 
catcagggga 
atggcgatta 
ctgaactgcc 
gaaaactatc 
gacatgtata 
ttgaattatg 
caacagcaac 
ctgaatatcg 
tcggcggaat 
taataataac 
attggacaaa 
ataatgtgtt 
tgatgaatgg 
ttggacaaac 
ctattgcttt 
ttcattttat 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100, 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 



112 



10 



15 



',0 



15 



30 



35 



40 



gtttcaggtt 
tggtatggct 
taatgtcatg 
cggaacccct 
ataaccctga 
ccgtgtcgcc 
aacgctggtg 
actggatctc 
gatgagcact 
agagcaactc 
cacagaaaag 
catgagtgat 
aaccgctttt 
gctgaatgaa 
aacgttgcgc 
agactggatg 
ctggtttatt 
actggggcca 
aactatggat 
gtaactgtca 
atttaaaagg 
tgagttttcg 
tccttttttt 
ggtttgtttg 
agcgcagata 
ctctgtagca 
tggcgataag 
gcggtcgggc 
cgaactgaga 
ggcggacagg 
agggggaaac 
tcgatttttg 
ctttttacgg 
ccctgattct 
ccgaacgacc 
accgcctctc 
ctggaaagcg 
ccaggcttta 
atttcacaca 



cagggggagg 

gattatgatc 

ataataatgg 

atttgtttat 

taaatgcttc 

cttattccct 

aaagtaaaag 

aacagcggta 

tttaaagttc 

ggtcgccgca 

catcttacgg 

aacactgcgg 

ttgcacaaca 

gccataccaa 

aaactattaa 

gaggcggata 

gctgataaat 

gatggtaagc 

gaacgaaata 

gaccaagttt 

atctaggtga 

ttccactgag 

ctgcgcgtaa 

ceggatcaag 

ccaaatactg 

ccgcctacat 

tcgtgtctta 

tgaacggggg 

tacctacagc 

tatccggtaa 

gcctggtatc 

tgatgctcgt 

ttcctggcct 

gtggataacc 

gagcgcagcg 

cccgcgcgtt 

ggcagtgagc 

cactttatgc 

ggaaacagct 



tgtgggaggt 
tgcggccgca 
tttcttagac 
ttttctaaat 
aataatattg 
tttttgcggc 
atgctgaaga 
agatecttga 
tgctatgtgg 
tacactattc 
atggcatgac 
ccaacttact 
tgggggatca 
acgacgagcg 
ctggcgaact 
aagttgcagg 
ctggagccgg 
cctcccgtat 
gacagatcgc 
actcatatat 
agatcctttt 
cgtcagaccc 
tctgctgctt 
agctaccaac 
tccttctagt 
acctcgctct 
ccgggttgga 
gttcgtgcac 
gtgagctatg 
gcggcagggt 
tttatagtcc 
caggggggcg 
tttgctggcc 
gtattaccgc 
agtcagtgag 
ggccgattca 
gcaacgcaat 
ttccggctcg 
atgaccatga 



tttttaaagc 

gggcctcgtg 

gtcaggtggc 

acattcaaat 

aaaaaggaag 

attttgcctt 

tcagttgggt 

gagttttcgc 

cgcggtatta 

tcagaatgac 

agtaagagaa 

tctgacaacg 

tgtaactcgc 

tgacaccacg 

acttactcta 

accacttctg 

tgagcgtggg 

cgtagttatc 

tgagataggt 

actttagatt 

tgataatctc 

cgtagaaaag 

gcaaacaaaa 

tctttttccg 

gtagccgtag 

gctaatcctg 

ctcaagacga 

acagcccagc 

agaaagcgcc 

cggaacagga 

tgtcgggttt 

gagcctatgg 

ttttgetcac 

ctttgagtga 

cgaggaagcg 

ttaatgcagc 

taatgtgagt 

tatgttgtgt 

ttacgccaag 



aagtaaaacc 

atacgcctat 

acttttcggg 

atgtatccgc 

agtatgagta 

cctgtttttg 

gcacgagtgg 

cccgaagaac 

tcccgtattg 

ttggttgagt 

ttatgcagtg 

atcggaggac 

cttgatcgtt 

atgcctgtag 

gcttcccggc 

cgctcggccc 

tctcgcggta 

tacacgacgg 

gcctcactga 

gatttaaaac 

atgaccaaaa 

atcaaaggat 

aaaccaccgc 

aaggtaactg 

ttaggccacc 

ttaccagtgg 

tagttaccgg 

ttggagcgaa 

acgcttcccg 

gagcgcacga 

cgccacctct 

aaaaacgcca 

atgttctttc 

gctgataccg 

gaagagcgcc 

tggcacgaca 

tagctcactc 

ggaattgtga 

ctggcgcg 



tctacaaatg 

ttttataggt 

gaaatgtgcg 

tcatgagaca 

ttcaacattt 

ctcacccaga 

gttacatcga 

gttttccaat 

acgccgggca 

actcaccagt 

etgccataac 

cgaaggagct 

gggaaccgga 

caatggcaac 

aacaattaat 

ttccggctgg 

tcattgcagc 

ggagtcaggc 

ttaagcattg 

ttqattttta 

tcccttaacg 

cttcttgaga 

taccagcggt 

gcttcagcag 

acttcaagaa 

ctgctgccag 

ataaggcgca 

cgacctacac 

aagggagaaa 

gggagcttcc 

gacttgagcg 

gcaacgcggc 

ctgcgttatc 

ctcgccgcag 

caatacgcaa 

ggtttcccga 

attaggcacc 

gcggataaca 



5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
624 0 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7608 
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50 



55 



60 



65 



<210> 79 
<211> 7523 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
pPGKnifD3' 

<400> 79 

tcgaggaatt ctaccgggta ggggaggcgc 
tagcagcccc gctgggcact tggcgctaca 
atccaccggt aggcgccaac cggctccgtt 
tcctccccta gtcaggaagt tcccccccgc 
aatggaagta gcacgtctca ctagtctcgt 
gcgggtaggc ctttggggca gcggccaata 
gaggctggga aggggtgggt ccgggggcgg 
cgcccgaagg tcctccggag gcccggcatt 
gctgttctcc tcttcctcat ctccgggcct 
acttcgtata gcatacatta tacgaagtta 
cgaccggccg gccggtgccg ccaccatccc 
gacgaccttc eatgaccgag tacaagccca 
cccgggccgt acgcaccctc gccgccgcgt 
tcgacccgga ccgccacatc gagcgggtca 
tcgggctcga catcggcaag gtgtgggtcg 
ccacgccgga gagcgtegaa gcgggggcgg 



Sequence: vector 



ttttcccaag 
caagtggcct 
ctttggtggc 
cccgcagctc 
gcagatggac 
gcagctttgc 
gctcaggggc 
ctgcacgctt 
ttcgacctgc 
taagcttgca 
ctgacccacg 
cggtgcgcct 
tcgccgacta 
ccgagctgca 
cggacgacgg 
tgttcgccga 



gcagtctgga 
ctggctcgca 
cccttcgcgc 
gcgtcgtgca 
agcaccgctg 
tccttcgctt 
gggctcaggg 
caaaagcgca 
agcccggtac 
tgcctgcagg 
cccctgaccc 
cgccacccgc 
ccccgccacg 
agaactcttc 
cgccgcggtg 
gatcggcccg 



gcatgcgctt 60 
cacattccac 120 
caccttctac 180 
ggacgtgaca 240 
agcaatggaa 300 
tctgggctca 360 
gcggggcggg 420 
cgtctgccgc 480 
agttcgaata 540 
tcggccgcca 600 
ctcacaagga 660 
gacgacgtcc 720 
cgccacaccg 780 
ctcaegcgcg 840 
gcggtctgga 900 
cgcatggccg 960 




agttgagcgg ttcccggctg gccgcgcagc aacagatgga aggcctcctg gcgccgcacc 1020 

ggcccaagga gcccgcgtgg ttcctggcca ccgtcggcgt ctcgcccgac caccagggca 1080 

agggtctggg cagcgccgtc gtgctccccg gagtggaggc ggccgagcgc gccggggtgc 1140 

ccgccttcct ggagacctcc gcgccccgca acctcccctt ctacgagcgg ctcggcttca 1200 

5 ccgtcaccgc cgacgtcgag tgcccgaagg accgcgcgac ctggtgcatg acccgcaagc 1260 

ccggtgcctg acgcccgccc cacgacccgc agcgcccgac cgaaaggagc gcacgacccc 1320 

atggetccga ccgaagccga cccgggcggc cccgccgacc ccgcacccgc ccccgaggcc 1380 

caccgactct agaggatcat aatcagccat accacatttg tagaggtttt acttgcttta 1440 

aaaaacctcc cacacctccc cctgaacctg aaacataaaa tgaatgcaat tgttgttgtt 1500 

10 aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 1560 

aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 1620 

tatcatgtct ggatccagct gttgaaagct attaaaccac aaaaaggatt actccggccc 1680 

ttatcacggt tacgacggat ttggatccat aacttcgtat agcatacatt atacgaagtt 1740 

ataccgggcc accatggtcg cgagtagctt ggcactggcc gtcgttttac aacgtegtga 1800 

15 ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 1860 

ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 1920 

tggcgaatgg cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga 1980 

gtgcgatctt cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta 2040 

cgatgcgccc atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc 2100 

20 cacggagaat ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca 2160 

ggaaggccag acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa 2220 

, cgggcgctgg gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc 2280 

! atttttacgc gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag 2340 

j ttatctggaa gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct 2400 

'25 gcataaaccg actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt 24 60 

j cagccgcgct gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg 2520 

ggtaacagtt tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg 2580 

cggtgaaatt atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt 264 0 

cgaaaacccg aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact 2700 

30 gcacaccgcc gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt 27 60 

gcggattgaa aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa 2820 

ccgtcacgag catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga 2880 

! tatcctgctg atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca 2940 

j tccgctgtgg tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat 3000 ! 

35 tgaaacccac ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc 3060 

j gatgagcgaa cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat 3120 

j ctggtcgctg gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat 3180 

: caaatctgtc gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc 3240 

caccgatatt atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt 3300 

.40 gccgaaatgg tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct 3360 

ttgcgaatac gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc 3420 

gtttcgtcag tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct 3480 

gattaaatat gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac 3540. 

gccgaacgat cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc 3600 

45 agcgctgacg gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac 3660 

catcgaagtg accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat 3720 

ggtggcgctg gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca 3780 

aggtaaacag ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg 3840 

gctcacagta cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag 3900 

50 cgcctggcag cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca 3960 

cgccatcccg catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg 4020 

ttggcaattt aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca 4080 

actgctgacg ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt 4140 

aagtgaagcg acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca 4200 

55 ttaccaggcc gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct 4260 

gattacgacc gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac 4320 

ctaccggatt gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga 4380 

tacaccgcat ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt 4440 

aaactggctc ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt 4500 

60 tgaccgctgg gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa 4560 

cggtctgcgc tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt 4620 

ccagttcaac atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct 4680 

gctgcacgcg gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg 4740 

cgacgactcc tggagcccgt cagtatcggc ggaattccag ctgagcgccg gtcgctacca 4800 

65. ttaccagttg gtctggtgtc aaaaataata ataaccgggc aggggggatc tttgtgaagg 4860 

aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt taaagctcta 4920 

aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta attgtttgtg 4980 
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tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat gccagatcca 5040 
gacatgataa gatacattga tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa 5100 
tgctttattt gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat 5160 
aaacaagtta acaacaacaa ttgcattcat tttatgtttc aggttcaggg ggaggtgtgg 5220 
5 gaggtttttt aaagcaagta aaacctctac aaatgtggta tggctgatta tgatctgcgg 5280 
ccgcagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 5340 
tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 5400 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 54 60 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 5520 

10 gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 5580 
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 5640 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 5700 
tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac 5760 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 5820 

15 atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 5880 
ttacttctga caaegatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg 5940 
gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac 6000 • 
gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc 6060 
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 6120 

20 gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 6180 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 6240 

| cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag 6300 

! atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca 6360 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 6420 

25 ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca 6480 ' 
gaccccgtag aaaagatcaa aggatettct tgagatcctt tttttctgcg cgtaatctgc 6540 
tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta 6600 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 6660 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc 6720 

30 gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 6780 
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg 6840 

; tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag 6900 

ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 6960 , 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 7020 

•35 agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg 7080 

[ gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc 7140 
tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt 7200 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 7260 
gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg 7320 

40 attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag tgagcgcaac 7380 
gcaattaatg tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg 7440 
gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa eagctatgac 7500 :.. 
catgattacg ccaagctggc gcg 7523 . 

45 

<210> 80 
<211> 4754 
<212> DNA 

<213> Artificial Sequence 

50 

<220> ■■■ ■ . 

<223> Description of Artificial Seguence: vector 
pRK55pbsC31-Re 

55 <400> 80 

ggccgcccga tatgacacaa ggggttgtga ccggggtgga cacgtacgcg ggtgcttacg 60 
accgtcagtc gcgcgagcgc gagaattcga gcgcagcaag cccagcgaca cagcgtagcg 120 ,, 
ccaacgaaga caaggcggcc gaccttcagc gcgaagtcga gcgcgacggg ggccggttca 180 
ggttcgtcgg gcatttcagc gaagcgccgg gcacgtcggc gttcgggacg gcggagcgcc 24 0 

60 cggagttcga acgcatcctg aacgaatgcc gcgccgggcg gctcaacatg atcattgtct 300 
atgacgtgtc gcgcttctcg cgcctgaagg tcatggacgc gattccgatt gtctcggaat 360 
tgctcgccct gggcgtgacg attgtttcca ctcaggaagg cgtcttccgg cagggaaacg 420 
tcatggacct gattcacctg attatgcggc tcgacgcgtc gcacaaagaa tcttcgctga 480 
agtcggcgaa gattctcgac acgaagaacc ttcagcgcga attgggcggg tacgtcggcg 540 

65 . ggaaggcgcc ttacggcttc gagcttgttt cggagacgaa ggagatcacg cgcaacggcc 600 
gaatggtcaa tgtcgtcatc aacaagcttg cgcactcgac cactcccctt accggaccct 660 
tcgagttcga gcccgacgta atccggtggt ggtggcgtga gatcaagacg cacaaacacc 720 



115 



\ n rim nan q 



10 



15 



10 



15 



50 



55 



40 



45 



50 



55 



60 



65 



ttcccttcaa 
agcgcatgga 
caagcgcctg 
tcgccgctga 
gttaccgcat 
tcatcgagcc 
aggggctttc 
gcgccgtcat 
gccggaaggt 
tggcggcact 
gcgacgaaga 
aggcgcctga 
acgcccttga 
ggaagcactt 
ggcttgccga 
aagacgccga 
acaagcgcgt 
gcagggggca 
ccgacgacga 
ccgggctcga 
agcttggcgt 
ccacacaaca 
taactcacat 
cagctgcatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gaeccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 
gggcgaaaac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 
gtgccaccta 
tcagctcatt 
agaccgagat 
tggactccaa 
catcacccta 
aagggagccc 
ggaagaaagc 
taaccaccac 
ggctgcgcaa 
cgaaaggggg 
gacgttgtaa 



gccgggcagt 
cgctgacgcc 
ggacccggca 
ggtgatctac 
tcagcgcgac 
cgctgagtgg 
ccgggggcaa 
gacttcgaag 
ggtcgacccg 
cgacaagttc 
gacgttggcg 
gaagagcggc 
agagctgtac 
ccggaagcaa 
acttgaagcc 
cgctgacccg 
gttcgtcggg 
gggaacgccc 
cgaagacgac 
gggggggccc 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 

cggggtctga 

caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 
gtatttagaa 
aattgtaagc 
ttttaaccaa 
agggttgagt 
cgtcaaaggg 
atcaagtttt 
ccgatttaga 
gaaaggagcg 
acccgccgcg 
ctgttgggaa 
atgtgctgca 
aacgacggcc 



caagccgcca 
gtgccgaccc 
accgttatgc 
aagaagaagc 
ccgatcacgc 
tatgagcttc 
gccattctgt 
cgcggggaag 
tccgcacctg 
gttgcggaac 
cttctgtggg 
gaacgggcga 
gaagaccgcg 
caggcagcgc 
gccgaagccc 
accggcccta 
ctcttcgtag 
atcgagaagc 
gcccaggacg 
ggtacccagc 
atagctgttt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtat'tt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 
aaataaacaa 
gttaatattt 
taggccgaaa 
gttgttccag 
cgaaaaaccg 
ttggggtcga 
gcttgacggg 
ggcgctaggg 
cttaatgcgc 
gggcgatcgg 
aggcgattaa 
agtgaattgt 



ttcacccggg 
ggggcgagac 
gaatccttcg 
cggacggcac 
tccggccggt 
aggcgtggtt 
ccgccatgga 
aatcgatcaa 
ggcagcacga 
gcatcttcaa 
aagccgcccg 
accttgttgc 
cggcaggcgc 
tgacgctccg 
cgaagcttcc 
agtcgtggtg 
acaagatcgt 
gcgcttcgat 
gcacggaaga 
ttttgttccc 
cctgtgtgaa 
tgtaaagcct 
cccgctttcc 
gggagaggcg 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 
cagcccgacc 
gacttatcgc 
ggtgctacag 
ggtatctgcg 
ggcaaacaaa 
agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 
ataggggttc 
tgttaaaatt 
tcggcaaaat 
tttggaacaa 
tctatcaggg 
ggtgccgtaa 
gaaagccggc 
cgctggcaag 
cgctacaggg 
tgcgggectc 
gttgggtaac 
aatacgactc 



cagcatcacg 
gattgggaag 
ggacccgcgt 
gccgaccacg 
cgagcttgat 
ggacggcagg 
caagctgtac 
ggactcttac 
aggcacgtgc 
caagatcagg 
acgcttcggc 
ggagcgcgcc 
gtacgacgga 
gcagcaaggg 
ccttgaccaa 
ggggcgcgcg 
tgtcacgaag 
cacgtgggcg 
cgtagcggcg 
tttagtgagg 
attgttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 
cgcgcacatt 
cgcgttaaat 
cccttataaa 
gagtccacta 
cgatggccca 
agcactaaat 
gaacgtggcg 
tgtagcggtc 
cgcgtcccat 
ttcgctatta 
gccagggttt 
actatagggc 



gggctttgta 
aagaecgctt 
attgcgggct 
aagattgagg 
tgcggaccga 
gggcgcggca 
tgcgagtgtg 
cgctgccgtc 
aacgtcagca 
cacgccgaag 
aagctcactg 
gacgccctga 
cccgttggca 
gcggaagagc 
tggttceccg 
tcagtagacg 
tcgactacgg 
aagccgccga 
taggcggcgc 
gttaatttcg 
gctcacaatt 
atgagtgagc 
cctgtcgtgc 
tgggcgetct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaatacgg 
acgttcttcg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 
tccccgaaaa 
ttttgttaaa 
tcaaaagaat 
ttaaagaacg 
ctacgtgaac 
cggaacccta 
agaaaggaag 
acgctgcgcg 
tcgceattca 
cgccagctgg 
tcccagtcac 
gaattggagc 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
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<210> 81 
<211> 4773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
pRK63pbsC31NLS-Re 



vector 



15 



55 



40 
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50 



55 



60 
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<400> 81 

ggccgcacca 

acgtacgcgg 

ccagcgacac 

cgcgacgggg 

ttcgggacgg 

ctcaacatga 

attccgattg 

gtcttccggc 

cacaaagaat 

ttgggcgggt 

gagatcacgc 

actcccctta 

atcaagacgc 

agcatcacgg 

attgggaaga 

gacccgcgta 

ccgaccacga 

gagcttgatt 

gacggcaggg 

aagctgtact 

gactcttacc 

ggcacgtgca 

aagatcaggc 

cgcttcggca 

gagcgcgccg 

tacgacggac 

cagcaagggg 

cttgaccaat 

gggcgcgcgt 

gtcacgaagt 

acgtgggcga 

gtagcggcgt 

ttagtgaggg 

ttgttatccg 

gggtgcctaa 

gtcgggaaac 

tttgcgtatt 

gctgcggcga 

ggataacgca 

ggccgcgttg 

acgctcaagt 

tggaagctcc 

ctttctccct 

ggtgtaggtc 

ctgcgcctta 

actggcagca 

gttcttgaag 

tctgctgaag 

caccgctggt 

atctcaagaa 

acgttaaggg 

ttaaaaatga 

ccaatgctta 

tgcctgactc 



tgcccaagaa 
gtgcttacga 
agcgtagcgc 
gccggttcag 
cggagcgccc 
tcattgtcta 
tctcggaatt 
agggaaacgt 
cttcgctgaa 
acgtcggcgg 
gcaacggccg 
ccggaccctt 
acaaacacct 
ggctttgtaa 
agaccgcttc 
ttgcgggctt 
agattgaggg 
gcggaccgat 
ggcgcggcaa 
gcgagtgtgg 
gctgccgtcg 
acgtcagcat 
acgccgaagg 
agctcactga 
acgccctgaa 
ccgttggcag 
cggaagagcg 
ggttccccga 
cagtagacga 
cgactacggg 
agccgccgac 
aggcggcgcc 
ttaatttcga 
ctcacaattc 
tgagtgagct 
ctgtcgtgcc 
gggcgctctt 
gcggtatcag 
ggaaagaaca 
ctggcgtttt 
cagaggtggc 
ctcgtgcgct 
tcgggaagcg 
gttcgctcca 
tccggtaact 
gccactggta 
tggtggccta 
ccagttacct 
agcggtggtt 
gatcctttga 
attttggtca 
agttttaaat 
atcagtgagg 
cccgtcgtgt 



gaagaggaag 
ccgtcagtcg 
caacgaagac 
gttcgtcggg 
ggagttcgaa 
tgacgtgtcg 
gctcgccctg 
catggacctg 
gtcggcgaag 
gaaggcgcct 
aatggtcaat 
cgagttcgag 
tcccttcaag 
gcgcatggac 
aagcgcctgg 
cgccgctgag 
ttaccgcatt 
catcgagccc 
ggggctttcc 
cgccgtcatg 
ccggaaggtg 
ggcggcactc 
cgacgaagag 
ggcgcctgag 
cgcccttgaa 
gaagcacttc 
gcttgccgaa 
agacgccgac 
caagcgcgtg 
cagggggcag 
cgacgacgac 
cgggctcgag 
gcttggcgta 
cacacaacat 
aactcacatt 
agctgcatta 
ccgcttcctc 
ctcactcaaa 
tgtgagcaaa 
tccataggct 
gaaacccgac 
ctcctgttcc 
tggcgctttc 
agctgggctg 
atcgtcttga 
acaggattag 
actacggcta 
tcggaaaaag 
tttttgtttg 
tcttttctac 
tgagattatc 
caatctaaag 
cacctatctc 
agataactac 



gtgacacaag 
cgcgagcgcg 
aaggcggccg 
catttcagcg 
cgcatcctga 
cgcttctcgc 
ggcgtgacga 
attcacctga 
attctcgaca 
tacggcttcg 
gtcgtcatca 
cccgacgtaa 
ccgggcagtc 
gctgacgccg 
gacccggcaa 
gtgatctaca 
cagcgcgacc 
gctgagtggt 
cgggggcaag 
acttcgaagc 
gtcgacccgt 
gacaagttcg 
acgttggcgc 
aagagcggcg 
gagctgtacg 
cggaagcaac 
cttgaagccg 
gctgacccga 
ttcgtcgggc 
ggaacgccca 
gaagacgacg 
ggggggcccg 
atcatggtca 
acgagccgga 
aattgcgttg 
atgaatcggc 
gctcactgac 
ggcggtaata 
aggccagcaa 
ccgcccccct 
aggactataa 
gaccctgccg 
tcatagctca 
tgtgcacgaa 
gtccaacccg 
cagagcgagg 
cactagaagg 
agttggtagc 
caagcagcag 
ggggtctgac 
aaaaaggatc 
tatatatgag 
agcgatctgt 
gatacgggag 



gggttgtgac 
agaattcgag 
accttcagcg 
aagcgccggg 
acgaatgccg 
gcctgaaggt 
ttgtttccac 
ttatgcggct 
cgaagaacct 
agcttgtttc 
acaagcttgc 
tccggtggtg 
aagccgccat 
tgccgacccg 
ccgttatgcg 
agaagaagcc 
cgatcacgct 
atgagcttca 
ccattctgtc 
gcggggaaga 
ccgcacctgg 
ttgcggaacg 
ttctgtggga 
aacgggcgaa 
aagaccgcgc 
aggcagcgct 
ccgaagcccc 
ccggccctaa 
tcttcgtaga 
tcgagaagcg 
cccaggacgg 
gtacccagct 
tagctgtttc 
agcataaagt 
cgctcactgc 
caacgcgcgg 
tcgctgcgct 
cggttatcca 
aaggccagga 
gacgagcatc 
agataccagg 
cttaccggat 
cgctgtaggt 
ccccccgttc 
gtaagacacg 
tatgtaggcg 
acagtatttg 
tcttgatccg 
attacgcgca 
gctcagtgga 
ttcacctaga 
taaacttggt 
ctatttcgtt 
ggcttaccat 



cggggtggac 
cgcagcaagc 
cgaagtcgag 
cacgtcggcg 
cgccgggcgg 
catggacgcg 
tcaggaaggc 
cgacgcgtcg 
tcagcgcgaa 
ggagacgaag 
gcactcgacc 
gtggcgtgag 
tcacccgggc 
gggcgagacg 
aatccttcgg 
ggacggcacg 
ccggccggtc 
ggcgtggttg 
cgccatggac 
atcgatcaag 
gcagcacgaa 
catcttcaac 
agccgcccga 
ccttgttgcg 
ggcaggcgcg 
gacgctccgg 
gaagcttccc 
gtcgtggtgg 
caagatcgtt 
cgcttcgatc 
cacggaagac 
tttgttccct 
ctgtgtgaaa 
gtaaagcctg 
ccgctttcca 
ggagaggcgg 
cggtcgttcg 
cagaatcagg 
accgtaaaaa 
acaaaaatcg 
cgtttccccc 
acctgtccgc 
atctcagttc 
agcccgaccg 
acttatcgcc 
gtgctacaga 
gtatctgcgc 
gcaaacaaac 
gaaaaaaagg 
acgaaaactc 
tccttttaaa 
ctgacagtta 
catccatagt 
ctggccccag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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tgctgcaatg ataccgcgag acccacgctc 
gccagccgga agggccgagc gcagaagtgg 
tattaattgt tgccgggaag ctagagtaag 
tgttgccatt gctacaggca tcgtggtgtc 
ctccggttcc caacgatcaa ggcgagttac 
tagctccttc ggtcctccga tcgttgtcag 
ggttatggca gcactgcata attctcttac 
gactggtgag tactcaacca agtcattctg 
ttgcccggcg tcaatacggg ataataccgc 
cattggaaaa cgttcttcgg ggcgaaaact 
ttcgatgtaa cccactcgtg cacccaactg 
ttctgggtga gcaaaaacag gaaggcaaaa 
gaaatgttga atactcatac tcttcctttt 
ttgtctcatg agcggataca tatttgaatg 
gcgcacattt ccccgaaaag tgccacctaa 
gcgttaaatt tttgttaaat cagctcattt 
ccttataaat caaaagaata gaccgagata 
agtccactat taaagaacgt ggactccaac 
gatggcccac tacgtgaacc atcaccctaa 
gcactaaatc ggaaccctaa agggagcccc 
aacgtggcga gaaaggaagg gaagaaagcg 
gtagcggtca cgctgcgcgt aaccaccaca 
gcgtcccatt cgccattcag gctgcgcaac 
tcgctattac gccagctggc gaaaggggga 
ccagggtttt cccagtcacg acgttgtaaa 
ctatagggcg aattggagct ccaccgcggt 



accggctcca gatttatcag caataaacca 3300 
tcctgeaact ttatccgcct ccatccagtc 3360 
tagttcgcca gttaatagtt tgcgcaacgt 3420 
acgctcgtcg tttggtatgg cttcattcag 3480 
atgatccccc atgttgtgca aaaaagcggt 3540 
aagtaagttg gccgcagtgt tateactcat 3600 
tgtcatgcca tccgtaagat gcttttctgt 3660 
agaatagtgt atgcggcgac cgagttgctc 3720 
gccacatagc agaactttaa aagtgctcat 3780 
ctcaaggatc ttaccgctgt tgagatccag 3840 
atcttcagca tcttttactt tcaccagcgt 3900 
tgccgcaaaa aagggaataa gggcgacacg 3960 
tcaatattat tgaagcattt atcagggtta 4020 
tatttagaaa aataaacaaa taggggttcc 4080 
attgtaagcg ttaatatttt gttaaaattc 4140 
tttaaccaat aggccgaaat cggcaaaatc 4200 
gggttgagtg ttgttccagt ttggaacaag 4260 
gtcaaagggc gaaaaaccgt ctatcagggc 4320 
tcaagttttt tggggtcgag gtgccgtaaa 4380 
cgatttagag cttgacgggg aaagccggcg 4440 
aaaggagcgg gcgctagggc gctggcaagt 4500 
cccgccgcgc ttaatgcgcc gctacagggc 4 560 
tgttgggaag ggcgatcggt gcgggcctct 4 620 
tgtgctgcaa ggcgattaag ttgggtaaeg 4680 
acgacggcca gtgaattgta atacgactca 4740 

4773 



<210> 82 
<211> 7803 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pPGKattAl 

<400> 82 

tatcatgtct ggatccgcgt taacacctaa gaaggcgaag ttttccttac accttgcaga 60 
tataaagtgt ctaacagttt aaaatatccg atgaggcata tttatgttgg acccgtagct 120 
cagccaggat agagcactgg cctccggagc cggaggtccc gggttcaaat cccggcgggt 180 
ccgtatatta ctttttgatt cagattagat ttgtaaatct ttattacaag^gataatttga 240 
tcttgtatat tggtaactct ctactctata atttttatga gaaattcaca gtcgtccctt 300 
tataccataa atagctaagt ttgtcaaagt tcttattaaa ctctccatgt agagattaaa 360 
tcggatccat aacttcgtat agcatacatt atacgaagtt ataccgggcc accatggtcg 420 
cgagtagctt ggcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 480 
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 540 
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgctttgcct 600 
ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt cctgaggccg 660 
atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc atctacacca 720 
acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat ccgacgggtt 780 
gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag acgcgaatta 840 
tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg gtcggttacg 900 
gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc gccggagaaa 960 
accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa gatcaggata 1020 
tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg actacacaaa 1080 
tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct gtactggagg 1140 
ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt tctttatggc 1200 
agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt atcgatgagc 1260 
gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg aaactgtgga 1320 
gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc gacggcacgc 1380 
tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa aatggtctgc 14 4 0 
tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag catcatcctc 1500 
tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg atgaagcaga 1560 
acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg tacacgctgt 1620 
gegaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac ggcatggtgc 1680 
caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa cgcgtaacgc 1740 




gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg gggaatgaat 1800 
caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc gatccttccc 1860 
gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt atttgcccga 1920 
tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg tccatcaaaa 1980 
5 aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac gcccacgcga 2040 
tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag tatccccgtt 2100 
tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat gatgaaaacg 2160 
gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat cgccagttct 2220 
gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg gaagcaaaac 2280 

10 accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg accagcgaat 2340 
acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg gatggtaagc 2400 
cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag ttgattgaac 24 60 
tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta cgcgtagtgc 2520 
aaccgaacgc gaccgcatgg tcagaagccg ggeacatcag cgcctggcag cagtggcgtc 2580 

15 tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg catctgacca 2640 
ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt aaccgccagt 2700 
caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg ccgctgcgcg 27 60 
atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg acccgcattg 2820 
accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc gaagcagcgt 2880 
) tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc gctcacgcgt 2940 
ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt gatggtagtg 3000 
gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat ccggcgcgga 3060 
ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc ggattagggc 3120 
cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg gatctgccat 3180 
5 tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc tgcgggacgc 3240 
gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac atcagccgct 3300 
acagtcaaca geaactgatg gaaaccagcc atcgccatct gctgcacgcg gaagaaggca 3360 
catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc tggagcccgt 3420 
cagtatcggc ggaattccag ctgagcgccg gtcgctacca ttaccagttg gtctggtgtc 3480 
|0 aaaaataata ataaccggge aggggggatc tttgtgaagg aaccttactt ctgtggtgtg 3540 
acataattgg acaaactacc tacagagatt taaagctcta aggtaaatat aaaattttta 3600 
1 agtgtataat gtgttaaact actgattcta attgtttgtg tattttagat tccaacctat 3660 

i ggaactgatg aatgggagca gtggtggaat gccagatcca gacatgataa gatacattga 3720 
tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa tgctttattt ; gtgaaatttg 3780 

$5 tgatgctatt gctttatttg taaccattat aagctgcaat aaacaagtta acaacaacaa 3840 
ttgcattcat tttatgtttc aggttcaggg ggaggtgtgg gaggtttttt aaagcaagta 3900 

I aaacctctac aaatgtggta tggctgatta tgatctgcgg ccgcagggcc tcgtgatacg 3960 

cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt 4020 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 4080 

40 tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 4140 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 4200 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 4260 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 4320 
agaacgtttt ecaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 4380 

45 tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 44 4 0 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 4500 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 4560 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 4620 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 4680 

50 tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta. ctctagcttc 4740 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 4800 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 4860 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 4920 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 4980 

55 actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 5040 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 5100 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 5160 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 5220 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 5280 

60 aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 5340 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 5400 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 54 60 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 5520 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 5580 

65 tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 564 0 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 5700 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 5760 



119 



10 



15 



0 



cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcccaata 
cgacaggttt 
cactcattag 
tgtgagcgga 
gcgtcgagga 
ctttagcagc 
cacatccacc 
tactcctccc 
acaaatggaa 
gaagcgggta 
tcagaggctg 
gggcgcccga 
cgcgctgttc 
ataacttcgt 
ccacgaccgg 
ggagacgacc 
tcccccgggc 
ccgtcgaccc 
gcgtcgggct 
ggaccacgcc 
ccgagttgag 
accggcccaa 
gcaagggtct 
tgcccgcctt 
tcaccgtcac 
agcccggtgc 
cccatggctc 
gcccaccgac 
ttaaaaaacc 
gttaacttgt 
acaaataaag 
tct 



gcggcctttt 
ttatcccctg 
cgcagccgaa 
cgcaaaccgc 
cccgactgga 
gcaccccagg 
taacaatttc 
attctaccgg 
cccgctgggc 
ggtaggcgcc 
ctagtcagga 
gtagcacgtc 
ggcctttggg 
ggaaggggtg 
aggtcctccg 
tcctcttcct 
atagcataca 
ccggccggtg 
ttccatgacc 
cgtacgcacc 
ggaccgccac 
cgacatcggc 
ggagagcgtc 
cggttcccgg 
ggagcccgcg 
gggcagcgcc 
cctggagacc 
cgccgacgtc 
ctgacgcccg 
cgaccgaagc 
tctagaggat 
tcccacacct 
ttattgcagc 
catttttttc 



tacggttcct 

attctgtgga 

cgaccgagcg 

ctctccccgc 

aagcgggcag 

ctttacactt 

acacaggaaa 

gtaggggagg 

acttggcgct 

aaccggctcc 

agttcccccc 

tcactagtct 

gcagcggcca 

ggtccggggg 

gaggcccggc 

catctccggg 

ttatacgaag 

ccgccaccat 

gagtacaagc 

ctcgccgccg 

atcgagcggg 

aaggtgtggg 

gaagcggggg 

ctggecgcgc 

tggttcctgg 

gtcgtgctcc 

tccgcgcccc 

gagtgcccga 

ccccacgacc 

cgacccgggc 

cataatcagc 

ccccctgaac 

ttataatggt 

actgcattct 



ggccttttgc 

taaccgtatt 

cagcgagtca 

gcgttggccg 

tgagcgcaac 

tatgcttccg 

cagctatgac 

cgcttttccc 

acacaagtgg 

gttctttggt 

cgccccgcag 

cgtgcagatg 

atagcagctt 

cgggctcagg 

attctgcacg 

cctttcgacc 

ttataagctt 

cccctgaccc 

ccacggtgcg 

cgttcgccga 

tcaccgagct 

tcgcggacga 

cggtgttcgc 

ageaacagat 

ccaccgtcgg 

ccggagtgga 

gcaacctccc 

aggaccgcgc 

cgcagcgccc 

ggccccgccg 

cataccacat 

ctgaaacata 

tacaaataaa 

agttgtggtt 



tggccttttg 

accgcctttg 

gtgagcgagg 

attcattaat 

gcaattaatg 

gctcgtatgt 

catgattacg 

aaggcagtct 

cctctggctc 

ggccccttcg 

ctcgcgtcgt 

gacagcaccg 

tgctccttcg 

ggcgggctca 

cttcaaaagc 

tgcagcccgg 

gcatgcctgc 

acgcccctga 

cctcgccacc 

ctaccccgcc 

gcaagaactc 

cggcgccgcg 

cgagatcggc 

ggaaggcctc 

cgtctcgccc 

ggcggccgag 

cttctacgag 

gacctggtgc 

gaccgaaagg 

accccgcacc 

ttgtagaggt 

aaatgaatgc 

gcaatagcat 

tgtccaaact 



ctcacatgtt 
agtgagctga 
aagcggaaga 
gcagctggca 
tgagttagct 
tgtgtggaat 
ccaagctggc 
ggagcatgcg 
gcacacattc 
cgccaccttc 
gcaggacgtg 
ctgagcaatg 
ctttctgggc 
ggggcggggc 
gcacgtctgc 
tacagttcga 
aggtcggccg 
cccctcacaa 
cgcgacgacg 
acgcgccaca 
ttcctcacgc 
gtggcggtct 
ccgcgcatgg 
ctggcgccgc 
gaccaccagg 
cgcgccgggg 
cggctcggct 
atgacccgca 
agcgcacgac 
cgcccccgag 
tttacttgct 
aattgttgtt 
cacaaatttc 
catcaatgta 



5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7803 
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<210> 83 
<211> 8167 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pPGKattA2 



<400> 83 

tatcatgtct 

tataaagtgt 

cagccaggat 

ccgtatatta 

tcttgtatat 

tataccataa 

tcggatccat 

cgagtagctt 

cccaacttaa 

•cccgcaccga 

ggtttccggc 

atactgtcgt 

acgtaaccta 

gttactcgct 

tttttgatgg 

gccaggacag 

accgcctcgc 

tgtggcggat 

tcagcgattt 

ctgaagttca 



ggatccgcgt 
ctaacagttt 
agagcactgg 
ctttttgatt 
tggtaactct 
atagctaagt 
aacttcgtat 
ggcactggcc 
tcgccttgca 
tcgcccttcc 
accagaagcg 
cgtcccctca 
tcccattacg 
cacatttaat 
cgttaactcg 
tcgtttgccg 
ggtgatggtg 
gagcggcatt 
ccatgttgcc 
gatgtgcggc 



taacacctaa 
aaaatatccg 
cctccggagc 
cagattagat 
ctactctata 
ttgtcaaagt 
agcatacatt 
gtcgttttac 
gcacatcccc 
caacagttgc 
gtgccggaaa 
aactggcaga 
gtcaatccgc 
gttgatgaaa 
gcgtttcatc 
tctgaatttg 
ctgcgttgga 
ttccgtgacg 
actcgcttta 
gagttgcgtg 



gaaggcgaag 
atgaggcata 
cggaggtccc 
ttgtaaatct 
atttttatga 
tcttattaaa 
atacgaagtt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
gctggctgga 
tgcacggtta 
cgtttgttcc 
gctggctaca 
tgtggtgcaa 
acctgagcgc 
gtgacggcag 
tctcgttgct 
atgatgattt 
actacctacg 



ttttccttac 
tttatgttgg 
gggttcaaat 
ttattacaag 
gaaattcaca 
ctctccatgt 
ataccgggcc 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
gtgcgatctt 
cgatgcgecc 
cacggagaat 
ggaaggccag 
cgggcgctgg 
atttttacgc 
ttatctggaa 
gcataaaccg 
cagccgcgct 
ggtaacagtt 



accttgcaga 
acccgtagct 
cccggcgggt 
gataatttga 
gtcgtccctt 
agagattaaa 
accatggtcg 
cctggcgtta 
agcgaagagg 
cgctttgcct 
cctgaggccg 
atctacacca 
ccgacgggtt 
acgcgaatta 
gtcggttacg 
gccggagaaa 
gatcaggata 
actacacaaa 
gtactggagg 
tctttatggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



i Oi[:i! ; ii ! «* o «3 l m ;t i. 



agggtgaaac 
gtggtggtta 



gcg'ccgaaat 
tgattgaagc 
tgctgctgaa 
tgcatggtca 
acaactttaa 
gcgaccgcta 
caatgaatcg 
gaatggtgca 
caggccacgg 
gcccggtgca 
tgtacgcgcg 
aatggctttc 
tgggtaacag 
tacagggcgg 
gcaacccgtg 
gtatgaacgg 
accagcagca 
acctgttccg 
cgctggcaag 
tgcctgaact 
aaccgaacgc 
tggcggaaaa 
ccagcgaaat 
caggctttct 
atcagttcac 
accctaacgc 
tgttgcagtg 
ggcagcatca 
gtcaaatggc 
ttggcctgaa 
cgcaagaaaa 
tgtcagacat 
gcgaattgaa 
acagtcaaca 
catggctgaa 
cagtatcggc 
aaaaataata 
acataattgg 
agtgtataat 
ggaactgatg 
tgagtttgga 
tgatgctatt 
ttgcattcat 
aaacctctac 
cctattttta 
tcggggaaat 
tccgctcatg 
gagtattcaa 
ttttgctcac 
agtgggttac 
agaacgtttt 
tattgacgcc 
tgagtactca 
cagtgctgcc 
aggaecgaag 
tcgttgggaa 
tgtagcaatg 
ccggcaacaa 
ggcccttccg 
cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 



gcaggtcgcc 
tgccgatcgc 
cccgaatctc 
agaagcctgc 
cggcaagccg 
ggtcatggat 
cgccgtgcgc 
cggcctgtat 
tctgaccgat 
gcgcgatcgt 
cgctaatcac 
gtatgaaggc 
cgtggatgaa 
gctacctgga 
tcttggcggt 
cttcgtctgg 
gtcggcttac 
tctggtcttt 
gtttttccag 
tcatagcgat 
cggtgaagtg 
accgcagocg 
gaccgcatgg 
cctcagtgtg 
ggatttttgc 
ttcacagatg 
ccgtgcaccg 
ctgggtcgaa 
cacggcagat 
ggggaaaacc 
gattaccgtt 
ctgccagctg 
ctatcccgac 
gtataccccg 
ttatggccca 
gcaactgatg 
tatcgacggt 
ggaattccag 
ataaccgggc 
acaaactacc 
gtgttaaact 
aatgggagca 
caaaccacaa 
gctttatttg 
tttatgtttc 
aaatgtggta 
taggttaatg 
gtgcgcggaa 
agacaataac 
catttccgtg 
ccagaaa'cgc 
atcgaactgg 
ccaatgatga 
gggcaagagc 
ccagtcacag 
ataaccatga 
gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 
gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 



agcggcaccg 
gtcacactac 
tatcgtgcgg 
gatgtcggtt 
ttgctgattc 
gagcagacga 
tgttcgcatt 
gtggtggatg 
gatccgcgct 
aatcacccga 
gacgcgetgt 
ggcggagccg 
gaccagccct 
gagacgcgcc 
ttcgctaaat 
gactgggt'gg 
ggcggtgatt 
gccgaccgca 
ttccgtttat 
aacgagctcc 
cctctggatg 
gagagcgccg 
tcagaagccg 
acgctccccg 
atcgagctgg 
tggattggcg 
ctggataacg 
cgctggaagg 
acacttgctg 
ttatttatca 
gatgttgaag 
gcgcaggtag 
cgccttactg 
tacgtcttcc 
caccagtggc 
gaaaccagcc 
ttccatatgg 
ctgagcgccg 
aggggggatc 
tacagagatt 
actgattcta 
gtggtggaat 
ctagaatgca 
taaccattat 
aggttcaggg 
tggctgatta 
tcatgataat 
cccctatttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 
atctcaacag 
gcacttttaa 
aactcggtcg 
aaaagcatct 
gtgataacac 
cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 
ggccagatgg 
tggatgaacg 
tgtcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 



cgcctttcgg 
gtctgaacgt 

tggttgaact 
tccgcgaggt 
gaggcgttaa 
tggtgcagga 



atccgaacca 
aagccaatat 
ggctaccggc 
gtgtgatcat 
atcgctggat 
acaccacggc 
tcccggctgt 
cgctgatcct 
actggcaggc 
atcagtcgct 
ttggcgatac 
cgccgcatcc 
ccgggcaaac 
tgcactggat 
tcgctccaca 
ggcaactctg 
ggcacatcag 
ccgcgtccca 
gtaataagcg 
ataaaaaaca 
acattggcgt 
cggcgggcca 
atgcggtgct 
gccggaaaac 
tggcgagcga 
cagagcgggt 
ccgcctgttt 
cgagcgaaaa 
gcggcgactt 
atcgccatct 
ggattggtgg 
gtcgctacca 
tttgtgaagg 
taaagctcta 
attgtttgtg 
gccagatcca 
gtgaaaaaaa 
aagctgcaat 
ggaggtgtgg 
tgatctgcgg 
aatggtttct 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 



cggtgaaatt 
cgaaaacccg 
gcacaccgcc 
gcggattgaa 
ccgtcacgag 
tatcctgctg 
tccgctgtgg 
tgaaacccac 
gatgagcgaa 
ctggtcgctg 
caaatctgtc 
caccgatatt 
gccgaaatgg 
ttgcgaatac 
gtttcgtcag 
gattaaatat 
gccgaacgat 
agcgctgacg 
catcgaagtg 
ggtggcgctg 
aggtaaacag 
gctcacagta 
cgcctggcag 
cgccatcccg 
ttggcaattt 
actgctgacg 
aagtgaagcg 
ttaccaggcc 
gattacgacc 
ctaccggatt 
tacaccgcat 
aaactggctc 
tgaccgctgg 
cggtctgcgc 
ccagttcaac 
gctgcacgcg 
cgacgactcc 
ttaccagttg 
aaccttactt 
aggtaaatat 
tattttagat 
gacatgataa 
tgctttattt 
aaacaagtta 
gaggtttttt 
ccgcagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 



atcgatgagc 
aaactgtgga 
gacggcacgc 
aatggtctgc 
catcatcctc 
atgaagcaga 
tacacgctgt 
ggcatggtgc 
cgcgtaacgc 
gggaatgaat 
gatccttecc 
atttgcccga 
tccatcaaaa 
gcccacgcga 
tatccccgtt 
gatgaaaacg 
cgccagttct 
gaagcaaaac 
accagcgaat 
gatggtaagc 
ttgattgaac 
cgcgtagtgc 
cagtggcgt'c 
catctgacca 
aaccgccagt 
ccgctgcgcg 
acccgcattg 
gaagcagcgt 
gctcacgcgt 
gatggtagtg 
ccggcgcgga 
ggattagggc 
gatctgccat 
tgcgggacgc 
atcagccgct 
gaagaaggca 
tggagcccgt 
gtctggtgtc 
ctgtggtgtg 
aaaattttta 
tccaacctat 
gatacattga 
gtgaaatttg 
acaacaacaa 
aaagcaagta 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
324 0 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
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accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 5280 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 534 0 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 5400 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 54 60 
5 accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 5520 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 5580 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 5640 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 5700 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 5760 

10 cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 5820 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 5880 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 5940 
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 6000 
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 6060 

15 cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 6120 
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagctggc 6180 
gcgtcgagga attctaccgg gtaggggagg cgcttttccc aaggcagtct ggagcatgcg 6240 
ctttagcagc cccgctgggc acttggcgct acacaagtgg cctctggctc gcacacattc 6300 

. i cacatccacc ggtaggcgcc aaccggctcc gttctttggt ggccccttcg cgccaccttc 6360 
) tactcctccc ctagtcagga agttcccccc cgccccgcag ctcgcgtcgt gcaggacgtg 6420 
acaaatggaa gtagcacgtc tcactagtct cgtgcagatg gacagcaccg ctgagcaatg 64 80 
,' gaagcgggta ggcctttggg gcagcggcca atagcagctt tgctccttcg ctttctgggc 6540 
tcagaggctg ggaaggggtg ggtccggggg cgggctcagg ggcgggctca ggggcggggc 6600 
gggcgcccga aggtcctccg gaggcccggc attctgcacg cttcaaaagc gcacgtctgc 6660 
5 cgcgctgttc tcctcttcct catctccggg cctttcgacc tgcagcccgg tacagttcga 6720 
aggatccgcg ttaacaccta agaaggcgaa gttttcctta caecttgcag atataaagtg 6780 
tctaacagtt taaaatatcc gatgaggcat atttatgttg gacccgtagc tcagccagga 684 0 
tagagcactg gcctccggag ccggaggtcc cgggttcaaa tcccggcggg tccgtatatt 6900 
actttttgat tcagattaga tttgtaaatc tttattacaa ggataatttg atcttgtata 6960 
)0 ttggtaactc tctactctat aatttttatg agaaattcac agtcgtccct ttataccata 7020 
aatagctaag tttgtcaaag ttcttattaa actctccatg tagagattaa atcggatcct 7080 
tcgaataact tcgtatagca tacattatac gaagttataa gcttgcatgc ctgcaggtcg 714 0 
gccgccacga ccggccggcc ggtgccgcca ccatcccctg acccacgccc ctgacccctc 7200 
acaaggagac gaccttccat gaccgagtac aagcccacgg tgcgcctcgc cacccgcgac 7260 

35 gacgtccccc gggccgtacg caccctcgcc gccgcgttcg ccgactaccc cgccacgcgc 7320 
cacaccgtcg acccggaccg ccacatcgag cgggtcaccg agctgcaaga actcttcctc 7380 
acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg acgacggcgc cgcggtggcg 7440 
gtctggacca cgccggagag cgtcgaagcg ggggcggtgt tcgccgagat cggcccgcgc 7500 
atggccgagt tgagcggttc ccggctggcc gcgcagcaac agatggaagg cctcctggcg 7560 

40 ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg tcggcgtctc gcccgaccac 7620 
cagggcaagg gtctgggcag cgccgtcgtg ctccccggag tggaggcggc cgagcgcgcc 7 680 
ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc tccccttcta cgagcggctc 7740 
ggcttcaccg tcaccgccga cgtcgagtgc ccgaaggacc gcgcgacctg gtgcatgacc 7800 
cgcaagcccg gtgcctgacg cccgccccac gacccgcagc gcccgaccga aaggagcgca 7860 

45 cgaccccatg gctccgaccg aagccgaccc gggcggcccc gccgaccccg cacccgcccc 7920 
cgaggcccac cgactctaga ggatcataat cagccatacc acatttgtag aggttttact 7980 
tgctttaaaa aacctcccac acctccccct gaacctgaaa cataaaatga atgeaattgt 8040 
tgttgttaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 8100 
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 8160 

50 tgtatct " " 8167 

<210> 84 
<211> 51 
55 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer XisAl 



60 



<400> 84 

ataagaatgc ggccgcccga tatgcaaaat cagggtcaag acaaatatca a 51 



65 . <210> 85 
<211> 76 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer XisA2 

<400> 85 

ataagaatgc ggccgcacca 
caaatatcaa caagcc 

10 

<210> 86 
<211> 44 
<212> DNA 
<213> Artificial Sequence 

15 

<220> 

<223> Description of Artificial Sequence: primer XisA3 
<400> 86 

D ataagaatgc ggccgctcaa ctattcttat aagctatttc catc 44 



1 <210> 87 

i , <211> 82 

5 <212> DNA 

! <213> Artificial Sequence 

i <220> 

i | <223> Description of Artificial Sequence: primer nifDl 
0 

<400> 87 

cgatggctct tcccttccgt caaatgcact cttgggatta ctccgaacct agcgatgggg 60 
tgcaaatgtc agatcagata ag 82 

i '5 

; <210> 88 

<211> 82 
<212> DNA 
L <213> Artificial Sequence 

40 

<220> 

<223> Description of Artificial Sequence: primer nifD2 
<400> 88 

45 cgcttatctg atctgacatt tgcaccccat cgctaggttc ggagtaatcc caagagtgca 60 
tttgacggaa gggaagagcc at 82 



<210> 89 
50 <211> 74 
<212> DNA 
<213> Artificial Sequence 



tgcccaagaa gaagaggaag gtgcaaaatc agggtcaaga 60 

76 



<220> 

55 <223> Description of Artificial Sequence: primer nifD3 
<400> 89 

gatcagctgt tgaaagctat taaaccacaa aaaggattac tccggccctt atcacggtta 60 
cgacggattt gcta 74 



<210> 90 
<211> 74 
<212> DNA 
65 <213> Artificial 



Sequence 



<220> 
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<223> Description of Artificial Sequence: primer nifD4 

<400> 90 

gatctagcaa atccgtcgta accgtgataa gggccggagt aatccttttt gtggtttaat 60 
5 agctttcaac agct 74 



<210> 91 
<211> 41 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primerSSVl-1 
<400> 91 

ataagaatgc ggccgcccga tatgacgaaa gataagacgc g 41 

) <210> 92 

<211> 70 
; <212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> Description of Artificial Sequence: primer SSV1-2 



<400> 92 

ataagaatgc ggccgcacca tgcccaagaa gaagaggaag gtgacgaaag ataagacgcg 60 
50 ttataaatac " ~ 70 



,' <210> 93 

i <211> 47 

35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer SSV2 



45 <210> 94 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: primer SSV3 
<400> 94 

^ gagcccggga caagcggaag cggtggtgga aaagagggaa ctgaacg 47 

<210> 95 
<211> 33 
<212> DNA 
60 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer SSV4 
65. <400> 95 

atcgctcgag tcagacccct tttagccatt ccg 33 



; 40 



<400> 93 



tgtcccgggc tcgaaaccgg ggggatccgc ttgtagggga gtatccc 



47 
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<210> 96 
<211> 40 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer SSV5 

10 <400> 96 

atcgttcgaa ggatccgcgt taacacctaa gaaggcgaag 



40 



<210> 97 
15 <211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

) <223> Description of Artificial Sequence: primer SSV6 
<400> 97 

atcgttcgaa ggatccgatt taatctctac atggagag 



38 



10 



15 



<210> 98 
<211> 64 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer C31-2 
<400> 98 

at'aagaatgc ggccgcacca tgcccaagaa gaagaggaag gtgacacaag gggttgtqac 60 



<210> 99 
40 <211> 4831 
<212> DNA 

<213> Artificial Sequence*' " 
<220> 

45 <223> Description of Artificial Sequence: vect 



50 



55 



60 



65 



or pRK41 



<400> 99 

ggccgcacca 

acgtacgcgg 

ccagcgacac 

cgcgacgggg 

ttcgggacgg 

ctcaacatga 

attccgattg 

gtcttccggc 

cacaaagaat 

ttgggcgggt 

gagatcacgc 

actcccctta 

atcaagacgc 

agcatcacgg 

attgggaaga 

gacccgcgta 

ccgaccacga 

gagcttgatt 

gacggcaggg 

aagctgtact 



tgcccaagaa 
gtgcttacga 
agcgtagcgc 
gccggttcag 
cggagcgccc 
tcattgtcta 
tctcggaatt 
agggaaacgt 
cttcgctgaa 
acgtcggcgg 
gcaacggccg 
ccggaccctt 
acaaacacct 
ggctttgtaa 
agaccgcttc 
ttgcgggett 
agattgaggg 
gcggaccgat 
ggcgcggcaa 
gcgagtgtgg 



gaagaggaag 
ccgtcagtcg 
caacgaagac 
gttcgtcggg 
ggagttcgaa 
tgacgtgtcg 
gctcgccctg 
catggacctg 
gtcggcgaag 
gaaggcgcct 
aatggtcaat 
cgagttcgag 
tcccttcaag 
gcgcatggac 
aagcgcctgg 
cgccgctgag 
ttaccgcatt 
catcgagccc 
ggggctttcc 
cgccgtcatg 



gtgacacaag 

cgcgagcgcg 

aaggcggccg 

catttcagcg 

cgcatcctga 

cgcttctcgc 

ggcgtgacga 

attcacctga 

attctcgaca 

tacggcttcg 

gtcgtcatca. 

cccgacgtaa 

ccgggcagtc 

gctgacgccg 

gacccggcaa 

gtgatctaca 

cagcgcgacc 

gctgagtggt 

egggggcaag 

acttcgaagc 



gggttgtgac 
agaattcgag 
accttcagcg 
aagcgccggg 
acgaatgccg 
gcctgaaggt 
.ttgtttccac 
ttatgcggct 
cgaagaacct 
agcttgtttc 
acaagcttgc 
tccggtggtg 
aagccgccat 
tgccgacccg 
ccgttatgcg 
agaagaagcc 
cgatcacgct 
atgagcttca 
ccattctgtc 
gcggggaaga 



cggggtggac 
cgcagcaagc 
cgaagtcgag 
cacgtcggcg 
cgccgggcgg 
catggacgcg 
tcaggaaggc 
cgacgcgtcg 
tcagcgcgaa 
ggagacgaag 
gcactcgacc 
gtggcgtgag 
tcacccgggc 
gggcgagacg 
aatccttcgg 
ggacggcacg 
ccggccggtc 
ggcgtggttg 
cgccatggac 
atcgatcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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gactcttacc gctgccgtcg ccggaaggtg gtcgacccgt ccgcacctgg gcagcacgaa 1260 
ggcacgtgca acgtcagcat ggcggcactc gacaagttcg ttgcggaacg catcttcaac 1320 
aagatcaggc acgccgaagg cgacgaagag acgttggcgc ttctgtggga agccgcccga 1380 
cgcttcggca agctcactga ggcgcctgag aagagcggcg aacgggcgaa ccttgttgcg 14 40 
5 gagcgcgccg acgccctgaa cgcccttgaa gagctgtacg aagaccgcgc ggcaggcgcg 1500 
tacgacggac ccgttggcag gaagcacttc cggaagcaac aggcagcgct gacgctccgg 1560 
cagcaagggg cggaagagcg gcttgccgaa cttgaagccg ccgaagcccc gaagcttccc 1620 
cttgaccaat ggttccccga agacgccgac gctgacccga ccggccctaa gtcgtggtgg 1680 
gggcgcgcgt cagtagacga caagcgcgtg ttcgtcgggc tcttcgtaga caagatcgtt 1740 
10 gtcacgaagt cgactacggg cagggggcag ggaacgccca tcgagaagcg cgcttcgatc 1800 
acgtgggcga agccgccgac cgacgacgac gaagacgacg cccaggacgg cacggaagac 18 60 
gtagcggcgt agcggccgct ctagaactag tggatccccc gggctgcagg aattcgatat 1920 
caagcttatc gataccgtcg acctcgaggg ggggcccggt acccagcttt tgttcccttt 1980 
agtgagggtt aatttcgagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 2040 
15 gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg 2100 
gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 2160 
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 2220 
tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 2280 
I tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 2340 
) ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 24 00 
| ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 24 60 
.gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 2520 
| gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 2580 
j ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 2640 
5 tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 2700 
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 2760 
' tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 2820 
j tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 2880 
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 2940 

0 ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 3000 
j ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 30 60 

1 gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 3120 
I aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 3180 
|. aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 324 0 
55 cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 3300 

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 3360 

, cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 34 20 
ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 34 80 
ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 3540 

40 ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 3600 
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 3660 
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 3720 
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 37 80 
gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 3840 

45 ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 3900 
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 3960 
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 4020 
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 4080 
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 4140 

50 gcacatttcc ccgaaaagtg ccacctaaat tgtaagcgtt aatattttgt taaaattcgc 4200 
gttaaatttt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc 42 60 
ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag 4320 
tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga 4380 
tggcccacta cgtgaaccat caccctaatc aagttttttg gggtcgaggt gccgtaaagc 4440. 

55 actaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa 4500 
cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt 4560 
agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aatgcgccgc tacagggcgc 4 620 
gtcccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc 4680 
gctattacgc cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc 47 4 0 

60 agggttttcc cagtcacgac gttgtaaaac gacggccagt gaattgtaat acgactcact 4800 
atagggcgaa ttggagctcc accgcggtgg c 4831 



<210> 100 
65. <211> 20 
<212> DNA 

<213> Artificial Seguence 



0^ 126 



i! !,'!,''!! 1 .Mi !! 1 i!" !' ... !i 1 i 1 '"'IS . 



5 

10 
15 

r ■ 

! i 

! 5 
I 

io 

j . , 
35 

i 

! 

! 

40 
45 
50 
55 
60 
65 



<220> 

<223> Description of Artificial Sequence: primer 
C31-screen 1 

<400> 100 

gcgtgagatc aagacgcaca 20 



<210> 101 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
C31-screen 2 

<400> 101 

gcagcgqtaa gagtccttga t 21 



<210> 102 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
beta-Gal 3 



<400> 102 

atcctctgca tggtcaggtc 20 

<210> 103 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
beta-Gal 4 

<400> 103 

cgtggcctga ttcattcc 18 



<210> 104 
<211> 5878 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
pCAGGS-Cre-pA 



vector 



<400> 104 

cgccgcgtgc 

tgtgcgctcc 

gggctgcgag 

tgtgggcgcg 

acggcccggc 

gcggggggtg 

gctcggggga 

cagccattgc 

ctggcggagc 



ggcccgcgct 
gcgtgtgcgc 
gggaacaaag 
gcggtcgggc 
ttcgggtgcg 
gcggcaggtg 
ggggcgcggc 
cttttatggt 
cgaaatctgg 



gcccggcggc 
gaggggagcg 
gctgcgtgcg 
tgtaaccccc 
gggctccgtg 
ggggtgccgg 
ggccccggag 
aatcgtgcga 
gaggcgccgc 



tgtgagcgct 
cggccggggg 
gggtgtgtgc 
ccctgcaccc 
cggggcgtgg 
gcggggcggg 
cgccggcggc 
gagggcgcag 
cgcaccccct 



gcgggcgcgg 
cggtgccccg 
gtgggggggt 
ccctccccga 
cgcggggctc 
gccgcctcgg 
tgtcgaggcg 
ggacttcctt 
ctagcgggcg 



cgcggggctt 60 
cggtgcgggg 120 
gagcaggggg 180 
gttgctgagc 240 
gccgtgccgg 300 
gccggggagg 360 
cggcgagccg 420 
tgtcccaaat 480 
cgggcgaagc 540 



127 



mm- ■ 



ggtgcggcgc 



10 



15 



0 



40 



45 



50 



55 



60 



65 



gtccccttct 
acggggcagg 
tgagtactcc 
cgaggaggat 
ctggtcagaa 
gccatacact 
cagggcggcc 
ccaaaatttg 
ggacatgttc 
ttgccggtcg 
tgaagatgtt 
ccagcaacat 
tgacagcaat 
tgaacgtgca 
catggaaaat 
taacaccctg 
tgacggtggg 
tgtagagaag 
tggtgtagct 
cgcgccatct 
tcatcgattg 
acacagtgcc 
gatcatgcaa 
ggatagtgaa 
taaatgattg 
caataaaaga 
gggggaggcc 
agggggaggc 
accgagctcg 
tacccaactt 
ggcccgcacc 
gcggtatttt 
tacaatctgc 
cgcgccctga 
cgggagctgc 
cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 



cggcaggaag 
ccatctccag 
gcggggttcg 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
tcgaccatgc 
cctgcattac 
agggatcgcc 
tgggcggcat 
cgcgattatc 
ttgggccagc 
gctgtttcac 
aaacaggctc 
agcgatcgct 
ttacgtatag 
agaatgttaa 
gcacttagcc 
gatgatccga 
gccaccagcc 
atttacggcg 
cgtgtcggag 
gctggtggct 
acaggggcaa 
cagatccact 
tcattatttt 
agaatgaggc 
cagaatgacc 
aattcactgg 
aatcgccttg 
gatcgccctt 
ctccttacgc 
"tctgatgccg 
cgggcttgtc 
atgtgtcaga 
cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
_ cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
rtcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 



gaaatgggcg 
cctcggggct 
gcttctggcg 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
ccaagaagaa 
cggtcgatgc 
aggcgttttc 
ggtgcaagtt 
ttctatatct 
taaacatgct 
tggttatgcg 
tagcgttcga 
gccaggatat 
ccgaaattgc 
tccatattgg 
tgggggtaac 
ataactacct 
agctatcaac 
ctaaggatga 
ccgcgcgaga 
ggaccaatgt 
tggtgcgcct 
agttctaggg 
caatagatct 
gcggccaagg 

ttgggggagg 

ccgtcgtttt 
cagcacatcc 
cccaacagtt 
atctgtgcgg 
catagttaag 
tgctcccggc 
ggttttcacc 
tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 



gggagggcct 
gccgcagggg 
tgtgaccggc 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
gaggaaggtg 
aacgagtgat 
tgagcatacc 
gaataaccgg 
tcaggcgcgc 
tcatcgtcgg 
gcggatccga 
acgcactgat 
acgtaatctg 
caggatcagg 
cagaacgaaa 
taaactggtc 
gttttgccgg 
tcgcgccctg 
ctctggtcag 
tatggcccgc 
aaatattgtc 
gctggaagat 
ccgcgtcgac 
gtgtgttggt 
gggaggggga 
gggaggccag 
acaacgtcgt 
ccctttcgcc 
gcgcagcctg 
tatttcacac 
ccagccccga 
atccgcttac 
gtcatcaccg 
tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 



tcgtgcgtcg 
gacggctgcc 
ggctctagta 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctccacagg 
tccaatttac 
gaggttcgca 
tggaaaatgc 
aaatggtttc 
ggtctggcag 
tccgggctgc 
aaagaaaacg 
ttcgaccagg 
gcatttctgg 
gttaaagata 
acgctggtta 
gagcgatgga 
gtcagaaaaa 
gaagggattt 
agatacctgg 
gctggagttt 
atgaactata 
ggcgattagc 
ctcgagatcc 
tttttgtgtg 
ggccagaatg 
aatgaggcgc 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 
agacaagctg 
aaacgcgcga 
ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 



ccgcgccgcc 
ttcggggggg 
agcgttgggg 
tttccaaaaa 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
tgaccgtaca 
agaacctgat 
ttctgtccgt 
ccgcagaacc 
taaaaactat 
cacgaccaag 
ttgatgccgg 
ttcgttcact 
ggattgctta 
tctcacgtac 
gcaccgcagg 
tttccgtctc 
atggtgttgc 
ttgaagcaac 
cctggtctgg 
caataccgga 
tccgtaacct 
cattaacgcg 
aggcgcggat 
ccttggggga 
accttggggg 
gcccccgggt 
accctggcgt 
atagcgaaga 
ggcgcctgat 
gcactctcag 
cacccgctga 
tgaccgtctc 
gacgaaaggg 
cttagacgtc 
tctaaataca 
aatattgaaa 
ttgcggcatt 
ctgaagatca 
tccttgagag 
tatgtggcgc 
actattctca 
gcatgacagt 
acttacttct 
gggatcatgt 
acgagcgtga 
gcgaactact 
ttgcaggacc 
gagccggtga 
cccgtatcgt 
agatcgctga 
catatatact 
tcctttttga 
cagaccccgt 
gctgcttgca 
taccaactct 
ttctagtgta 
tcgctctgct 
ggttggactc 
cgtgcacaca 
agctatgaga 
gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
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tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4620 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4 680 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 4740 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 4800 

5 tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 4860 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 4920 
cgccaagcta gcccgggcta gcttgcatgc ctgcaggttt tcgacattga ttattgacta 4980 
gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg 5040 
ttacataact tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga 5100 
10 cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat 5160 
gggtggacta tttacggtaa actgcccact tggcagtaea tcaagtgtat catatgccaa 5220 
gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca 5280 
tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc gctattacca 5340 
tgggtcgagg tgagccccac gttctgcttc actctcccca tctccccccc ctccccaccc 5400 
15 ccaattttgt atttatttat tttttaatta ttttgtgcag cgatgggggc gggggggggg 54 60 
ggggcgcgcg ccaggcgggg cggggcgggg cgaggggcgg ggcggggcga ggcggagagg 5520 
tgcggcggca gccaatcaga gcggcgcgct ccgaaagttt ccttttatgg cgaggcggcg 5580 
r gcggcggcgg ccctataaaa agcgaagcgc gcggcgggcg ggagtegctg cgttgccttc 5640 

! gccccgtgcc ccgctccgcg ccgcctcgcg ccgcccgccc cggctctgac tgaccgcgtt 5700 

3 actcccacag gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agcgcttggt 57 60 
ttaatgacgg ctcgtttctt ttctgtggct gcgtgaaagc cttaaagggc tccgggaggg 5820 

' ccctttgtgc gggggggagc ggctcggggg gtgcgtgcgt gtgtgtgtgc gtggggag 5878 

I 

5 <210> 105 
<211> 6641 
<212> DNA 

<213> Artificial Sequence 
}0 <220> 

<223> Description of Artificial Sequence: vector 
pCAGGSC31CNLS-pA 

<4 00> 105 

35 attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 60 

atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 120 

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 180 

tccattgacg tcaatgggtg gactatttac ggtaaactgc ccacttggca gtacatcaag 240 

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 300 

40 attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 360 

tcatcgctat taccatgggt cgaggtgagc cccacgttct gcttcactct ccccatctcc 420 

cccccctcqc cacccccaat tttgtattta tttatttttt aattattttg tgcagcgatg 480 

ggggcggg'gg gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg 540 

ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt 600 

45 tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt 660 

cgctgcgttg ccttcgcccc gtgccccgct ccgcgccgcc tcgcgccgcc cgccccggct 720 

ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc ctccgggctg 780 

taattagcgc ttggtttaat gacggctcgt ttcttttctg tggctgcgtg aaagccttaa 840 

agggctccgg gagggccctt tgtgcggggg ggagcggctc ggggggtgcg tgcgtgtgtg 900 

50 tgtgcgtggg gagcgccgcg tgcggcccgc gctgcccggc ggctgtgagc gctgcgggcg 960 ' 

cggcgcgggg ctttgtgcgc tccgcgtgtg cgcgagggga gcgcggccgg gggcggtgcc 1020 

ccgcggtgcg ggggggctgc gaggggaaca aaggctgcgt gcggggtgtg tgcgtggggg 1080 

ggtgagcagg gggtgtgggc gcggcggtcg ggctgtaacc cccccctgca cccccctccc 1140 

cgagttgctg agcacggccc ggcttcgggt gcggggctcc gtgcggggcg tggcgcgggg 1200 

55 ctcgccgtgc cgggcggggg gtggcggcag gtgggggtgc cgggcggggc ggggccgcct 1260 

cgggccgggg agggctcggg ggaggggcgc ggcggccccg gagcgccggc ggctgtcgag 1320 

gcgcggcgag ccgcagccat tgccttttat ggtaatcgtg cgagagggcg cagggacttc 1380 

ctttgtccca aatetggcgg agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg 1440 

gcgcgggcga agcggtgcgg cgccggcagg aaggaaatgg gcggggaggg ccttcgtgcg 1500 

60 tcgccgcgcc gccgtcccct tctccatctc cagcctcggg gctgccgcag ggggacggct 1560 

gccttcgggg gggacggggc agggcggggt tcggcttctg gcgtgtgacc ggcggctcta 1620 

gtaagcgttg gggtgagtac tccctctcaa aagcgggcat gacttctgcg ctaagattgt 1680 

cagtttccaa aaacgaggag gatttgatat tcacctggcc cgcggtgatg cctttgaggg 1740 

tggccgcgtc catctggtca gaaaagacaa tctttttgtt gtcaagcttg aggtgtggca 1800 

65 ggcttgagat ctggccatac acttgagtga cattgacatc cactttgcct ttctctccac 1860 

aggtgtccac tcccagggcg gccgcccgat atgacacaag gggttgtgac cggggtggac 1920 

acgtacgcgg gtgcttacga ccgtcagtcg cgcgagcgcg agaattcgag cgcagcaagc 1980 
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ccagcgacac agcgtagcgc caacgaagac aaggcggccg accttcagcg cgaagtcgag 2040 
cgcgacgggg gccggttcag gttcgtcggg "catttcagcg aagcgccggg cacgtcggcg 2100 
ttcgggacgg cggagcgccc ggagttcgaa cgcatcctga acgaatgccg cgccgggcgg 2160 
ctcaacatga tcattgtcta tgacgtgtcg cgcttctcgc gcctgaaggt catggacgcg 2220 
5 attccgattg tctcggaatt gctcgccctg ggcgtgacga ttgtttccac tcaggaaggc 2280 
gtcttccggc agggaaacgt catggacctg attcacctga ttatgcggct cgacgcgtcg 2340 
cacaaagaat cttcgctgaa gtcggcgaag attctcgaca cgaagaacct tcagcgcgaa 24 00 
ttgggcgggt acgtcggcgg gaaggcgcct tacggcttcg agcttgtttc ggagacgaag 2460 
gagatcacgc gcaacggccg aatggtcaat gtcgtcatca acaagcttgc gcactcgacc 2520 

10 actcccctta ccggaccctt cgagttcgag cccgacgtaa tccggtggtg gtggcgtgag 2580 
atcaagacgc acaaacacct tcccttcaag ccgggcagtc aagccgccat tcacccgggc 2640 
agcatcacgg ggctttgtaa gcgcatggac gctgacgccg tgccgacccg gggcgagacg 2700 
attgggaaga agaccgcttc aagcgcctgg gacccggcaa ccgttatgcg aatccttcgg 27 60 
gacccgcgta ttgcgggctt cgccgctgag gtgatctaca agaagaagcc ggacggcacg 2820 

15 ccgaccacga agattgaggg ttaccgcatt cagcgcgacc cgatcacgct ccggccggtc 2880 
gagcttgatt gcggaccgat catcgagccc gctgagtggt atgagcttca ggcgtggttg 2940 
gacggcaggg ggcgcggcaa ggggctttcc cgggggcaag ccattctgtc cgccatggac 3000 
aagctgtact gcgagtgtgg cgccgtcatg acttcgaagc gcggggaaga atcgatcaag 3060 
, gactcttacc gctgccgtcg ccggaaggtg gtcgacccgt ccgcacctgg gcagcacgaa 3120 
i ggcacgtgca acgtcagcat ggcggcactc gacaagttcg ttgcggaacg catcttcaac 3180 
i aagatcaggc acgccgaagg cgacgaagag acgttggcgc ttctgtggga agccgcccga 324 0 
cgcttcggca agctcactga ggcgcctgag aagagcggcg aacgggcgaa ccttgttgcg 3300 
gagcgcgccg acgccctgaa cgcccttgaa gagctgtacg aagaccgcgc ggcaggcgcg 3360 
tacgacggac ccgttggcag gaagcacttc cggaagcaac aggcagcgct gacgctccgg 3420 
') cagcaagggg cggaagagcg gcttgccgaa cttgaagccg ccgaagcccc gaagcttccc 3480 
cttgaccaat ggttccccga agacgccgac gctgacccga ccggccctaa gtcgtggtgg 3540 
gggcgcgcgt cagtagacga caagcgcgtg ttcgtcgggc tcttcgtaga caagatcgtt 3600 
gtcacgaagt cgactacggg cagggggcag ggaacgccca tcgagaagcg cgcttcgatc 3660 
acgtgggcga agccgccgac cgacgacgac gaagacgacg cccaggacgg cacggaagac 3720 
0 gtagcggcgc ctaagaagaa gaggaaggtt tagactctcg agatccaggc gcggatcaat 3780 
aaaagatcat tattttcaat agatctgtgt gttggttttt tgtgtgcctt gggggagggg 3840 
gaggccagaa tgaggcgcgg ccaaggggga gggggaggcc agaatgacct tgggggaggg 3900 
ggaggccaga atgaccttgg gggaggggga ggccagaatg aggcgcgccc ccgggtaccg 3960 
/■ agctcgaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc "t'ggcgt'ta'c'c 4020 

55 caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 4080 
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg 414 0 
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca 4200 
atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 4260 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 4320 

40 agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc 4380 
gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt 4440 
ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca 4500 
aatatgtatc cgctcatgag acaat'aaccc tgataaatgc ttcaataata ttgaaaaagg 4560 
aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 4 620 

45 cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 4680 
ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 4740 
cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta 4800 
ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 48 60 
gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 4920 

50 gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca 4980 
acgatcggag "gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact 5040 
cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc 5100 
acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact 5160 
ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt 5220 

55 ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt 5280 
gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt 5340 
atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata 54 00 
ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag 5460 
attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat 5520 

60 ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa 5580 
aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 5640 
aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt 5700 
ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg 5760 
tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc 5820 

65 ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga 5880 
cgatagttae cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc 5940 
agcttggagc gaacgaccta caccgaa'ctg agatacctac agcgtgagct atgagaaagc 6000 
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130 

gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca 6060 
ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg 6120 
tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta 6180 
tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct 6240 
cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag 6300 
tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa 6360 
gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc 6420 
agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg 6480 
agttagctca ctcattaggc accccaggct ttacacttta tgcttccggc tcgtatgttg 6540 
tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc 6600 
aagctagcec gggctagctt gcatgcctgc aggttttcga c 6641 
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ggcaggccct ccgagcgtgg tggagccgtt ctgtgagaca . gccgggtacg agtcgtgacg 60 
> ctggaagggg caagcgggtg gtgggcagga atgcggtccg ccctgcagca accggagggg 120 
gagggagaag ggagcggaaa agtctccacc ggacgcggcc atggctcggg gggggggggg 180 
cagcggagga gcgcttccgg ccgacgtctc gtcgctgatt ggcttctttt cctcccgccg 240 
! tgtgtgaaaa cacaaatggc gtgttttggt tggcgtaagg cgcctgtcag ttaacggcag 300 
ccggagtgcg cagccgccgg cagcctcgct ctgcccactg ggtggggcgg gaggtaggtg 360 
gggtgaggcg agctggacgt gcgggcgcgg tcggcctctg gcggggcggg ggaggggagg 420 
gagggtcagc gaaagtagct cgcgcgcgag cggccgccca ccctcccctt cctctggggg 480 
agtcgtttta cccgccgccg gccgggcctc gtcgtctgat tggctctcgg ggcccagaaa 54 0 
actggccctt gccattggct cgtgttcgtg caagttgagt ccatccgccg gccagcgggg 600 
gcggcgagga ggcgctccca ggttccggcc ctcccctcgg ccccgcgccg cagagtctgg 660 
,35 ccgcgcgecc ctgcgcaacg tggcaggaag cgcgcgctgg gggcggggac gggcagtagg 720 
gctgagcggc tgcggggcgg gtgcaagcac gtttccgact tgagttgcct caagaggggc 780 
gtgctgagcc agacctccat cgcgcactcc ggggagtgga gggaaggagc gagggctcag 840 
ttgggctgtt ttggaggcag gaagcacttg ctctcccaaa gtcgctctga gttgttatca 900 
gtaagggagc tgcagtggag taggcgggga gaaggccgca cccttctccg gaggggggag 960 
gggagtgttg caataccttt ctgggagttc tctgctgcct cctggcttct gaggaccgcc 1020 
ctgggcctgg gagaatccct tccccctctt ccctcgtgat ctgcaactcc agtctttcgc 1080 
ctaggtaacc gatatccctg caggggtgac ctgcacgtct agggcgcagt agtccagggt 1140 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 1200 
aaactcttcg cggtctttcc agtactcctg caggtgactg actgagtcga cgacactgca 1260 
45 gagacctact tcactaacaa ccggtacagt tcgtggacca gatgggtgag gtggagtacg 1320 
cgcccgggga gcccaagggc acgccctggc acccgcaccg cggcttcgag accgtcacga 1380 
ataacttcgt atagcataca ttatacgaag ttataagctc gatgaattct accgggtagg 1440 
ggaggcgctt ttcccaaggc agtctggagc atgcgcttta gcagccccgc tggcacttgg 1500 
cgctacacaa gtggcctctg gcctcgcaca cattccacat ccaccggtag cgccaaccgg 1560 
ctccgttctt tggtggcccc ttcgcgccac cttctactcc tcccctagtc aggaagttcc .1620 
cccccgcccc gcagctcgcg tcgtgcagga cgtgacaaat ggaagtagca cgtctcacta 1680 
gtctcgtgca gatggacagc accgctgagc aatggaagcg ggtaggcctt tggggcagcg 1740 
gccaatagca gctttgctcc ttcgctttct gggctcagag gctgggaagg ggtgggtccg 1800 
ggggcgggct caggggcggg ctcaggggcg gggcgggcgc gaaggtcctc ccgaggcccg 18 60 
gcattctcgc acgcttcaaa agcgcacgtc tgccgcgctg ttctcctctt cctcatctcc 1920 
gggcctttcg acgatccagc cgccaccatg aaaaagcctg aactcaccgc gacgtctgtc 1980 
gagaagtttc tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc 204 0 
gaagaatctc gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat 2100 
agctgcgccg atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg . 2160 
ctcccgattc cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc 2220 
tcccgccgtg cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt 2280 
ctgcagccgg tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc 2340 
gggttcggcc cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata 2400 
tgcgcgattg ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt 24 60 
gcgtccgtcg cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc 2520 
cggcacctcg tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata 2580 
acagcggtca ttgactggag cgaggcgatg ttcggggatt cccaatacga -ggtcgccaac 2640 



131 •'• 



10 



15 



40 



45 



50 



55 



60 



65 



atcttcttct 
aggcatccgg 
gaccaactct 
cgatgcgacg 
agaagcgcgg 
cgccccagca 
acaataaaga 
gaacagagta 
gattagataa 
catagttgga 
cactcatgat 
actttgtggt 
gatcagcagc 
tccatcagaa 
aggttttact 
atgcaattgt 
gcatcacaaa 
aactcatcaa 
aactgagaga 
tccataactt 
agcttggcac 
cttaatcgcc 
accgatcgcc 
ccggcaccag 
gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gaeagtcgtt 
ctcgcggtga 
cggatgagcg 
gatttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 
ggtcaggtca 
tttaacgccg 
cgctacggcc 
aatcgtctga 
gtgcagcgcg 
cacggcgcta 
gtgcagtatg 
gcgcgcgtgg 
ctttcgctac 
aacagtcttg 
ggcggcttcg 
ccgtggtcgg 
aacggtctgg 
cagcagtttt 
ttccgtcata 
gcaagcggtg 
gaactaccgc 
aacgcgaccg 
gaaaacctca 
gaaatggatt 
tttctttcac 
ttcacccgtg 
aacgcctggg 
cagtgcacgg 
catcagggga 
atggcgatta 
ctgaactgcc 
gaaaactatc 
gacatgtata 



ggaggccgtg 
agcttgcagg 
atcagagctt 
caatcgtccg 
ccgtctggac 
ctcgtccgag 
tgtccactaa 
cctacatttt 
atgcctgctc 
tatcataatt 
ctatagatct 
tctaagtact 
ctctgttcca 
gcttcagctg 
tgctttaaaa 
tgttgttaac 
tttcacaaat 
tgtatcttat 
actcaaaggt 
cgtatagcat 
tggccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 
cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 
tggatgagca 
tgcgctgttc 
tgtatgtggt 
ccgatgatcc 
atcgtaatca 
atcacgacgc 
aaggcggcgg 
atgaagacca 
ctggagagac 
gcggtttcgc 
tctgggactg 
cttacggcgg 
tctttgccga 
tccagttccg 
gcgataacga 
aagtgcctct 
agccggagag 
catggtcaga 
gtgtgacgct 
tttgcatcga 
agatgtggat 
caccgctgga 
tcgaacgctg 
cagatacact 
aaaccttatt 
ccgttgatgt 
agctggcgca 
ccgaccgcct 
ceccgtacgt 



gttggcttgt 
atcgccgcgg 
ggttgacggc 
atccggagcc 
cgatggctgt 
ggcaaaggaa 
aatggaagtt 
gaatggaagg 
tttactgaag 
taaacaagca 
atagatctct 
gtggtttcca 
catacacttc 
ctcgactaga 
aacctcccac 
ttgtttattg 
. aaagcatttt 
catgtctgga 
taccccagtt 
acattatacg 
tttacaacgt 
tccccctttc 
gttgcgcagc 
ggaaagctgg 
gcagatgcac 
tccgccgttt 
tgaaagctgg 
tcatctgtgg 
atttgacctg 
ttggagtgac 
tgacgtctcg 
ctttaatgat 
gcgtgactac 
caccgcgcct 
actacgtctg 
tgcggtggtt 
cggtttccgc 
gattcgaggc 
gacgatggtg 
gcattatccg 
ggatgaagcc 
gcgctggcta 
cccgagtgtg 
gctgtatcgc 
agccgacacc 
gcccttcccg 
gcgcccgctg 
taaatactgg 
ggtggatcag 
tgattttggc 
ccgcacgccg 
tttatccggg 
gctcctgcac 
ggatgtcgct 
cgccgggcaa 
agccgggcac 
ccccgccgcg 
gctgggtaat 
tggcgataaa 
taacgacatt 
gaaggcggcg 
tgctgatgcg 
tatcagccgg 
tgaagtggcg 
ggtagcagag 
tactgccgcc 
cttcccgagc • 



atggagcagc 
ctccgggcgt 
aatttcgatg 
gggactgtcg 
gtagaagtac 
tagtcgatgc 
tttcctgtca 
attggagcta 
gctctttact 
aaaccaaatt 
cgtgggatca 
aatgtgtcag 
attctcagta 
ggatcataat 
acctccccct 
cagcttataa 
tttcactgca 
tccgtgtcat 
ggggcactac 
aagttatacc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 
ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 
caggatatcc 
aaccatccgc 
aatattgaaa 
ccggcgatga 
atcatctggt 
tggatcaaat 
acggccaccg 
gctgtgccga 
atcctttgcg 
caggcgtttc 
tcgctgatta 
gatacgccga 
catccagcgc 
caaaccatcg 
tggatggtgg 
ccacaaggta 
ctctggctca 
atcagcgcct 
tcccacgcca 
aagcgttggc 
aaacaactgc 
ggcgtaagtg 
ggccattacc 
gtgctgatta 
aaaacctacc 
agcgatacac 
cgggtaaact 
tgttttgacc 
gaaaacggtc 



agacgcgcta 
atatgctccg 
atgcagcttg 
ggcgtacaca 
tcgccgatag 
agaaattgat 
tactttgtta 
cgggggtggg 
attgctttat 
aagggccagc 
ttgtttttct 
tttGatagcc 
ttgttttgcc 
cagccatacc 
gaacctgaaa 
tggttacaaa 
ttetagttgt 
gtcggcgacc 
tcccgaaaac 
gggccaccat 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 
cgcccatcta 
agaatccgac 
gceagacgeg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaact 
ccgccgacgg 
ttgaaaatgg 
acgagcatca 
tgctgatgaa 
tgtggtacac 
cccacggcat 
gcgaacgcgt 
cgctggggaa 
ctgtcgatcc 
atattatttg 
aatggtccat 
aatacgccca 
gtcagtatcc 
aatatgatga 
acgatcgcca 
tgacggaagc 
aagtgaccag 
cgctggatgg 
aacagttgat 
cagtacgcgt' 
ggcagcagtg 
tcccgcatct 
aatttaaccg 
tgacgccgct 
aagcgacccg 
aggccgaagc 
cgaccgctca 
ggattgatgg 
cgcatccggc 
ggctcggatt 
gctgggatct 
tgcgctgcgg 



cttcgagcgg 
cattggtctt 
ggcgcagggt 
aatcgcccgc 
tggaaaccga 
gatctattaa 
agaagggtga 
ggtggggtgg 
gataatgttt 
tcattcctcc 
cttgattccc 
tgaagaacga 
aagttctaat 
acatttgtag 
cataaaatga 
taaagcaata 
ggtttgtcca 
ctacgccccc 
cgcttctgga 
ggtcgcgagt 
cgttacccaa 
agaggcccgc 
tgcctggttt 
ggccgatact 



caccaacgta 
gggttgttac 
aattattttt 
ttacggccag 
agaaaaccgc 
ggatatgtgg 
acaaatcagc 
ggaggctgaa 
atggcagggt 
tgagcgt'ggt 
gtggagcgcc 
cacgctgatt 
tctgctgctg 
tcctctgcat 
gcagaacaac 
gctgtgcgac 
ggtgccaatg 
aacgcgaatg 
tgaatcaggc 
tteccgcccg 
cccgatgtac 
caaaaaatgg 
cgcgatgggt 
ccgtttacag 
aaacggcaac 
gttctgtatg 
aaaacac.cag 
cgaatac'ctg 
taagccgctg 
tgaactgcct 
agtgcaaccg 
gcgtctggcg 
gaccaccagc 
ccagtcaggc 
gcgcgatcag 
cattgaccct 
agcgttgttg 
cgcgtggcag 
tagtggtcaa 
gcggattggc 
agggccgcaa 
gccattgtca 
gacgcgcgaa 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420' 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
57 60 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 



ttgaattatg 



caacagcaac 
ctgaatatcg 
tcggcggaat 
taataataac 
attggacaaa 
ataatgtgtt 
tgatgaatgg 
ttggacaaac 
ctattgcttt 
ttcattttat 
tctacaaatg 
ggtaaccgaa 
taagcgctag 
gtgtgggcgt 
cacagatttt 
ataggtagtc 
cctcggagta 
ctgcttgaga 
gaattttaat 
agccttatca 
gcttatccaa 
tgactcatga 
ctcaacactg 
tccttaattt 
ctcagtccag 
tccactccca 
aatggttcca 
ctgatatttt 
gcttactaaa 
gttcaggtgt 
aagcagtgct 
aaatgtgatt 
caagtaaaat 
gtgtgtgctt 
agagcacaga 
cacaaccatc 
attcacatta 
tgtctccagt 
caataatcaa 
agtgtaactg 
agactttaat 
atcctatttg 
aaagctaagt 
aaaatgtagg 
cagcactctg 
tcaagttcta 
tgagatttca 
atccgtctac 
gtcttttgac 
ctactaactt 
atttgcagcc 
taattttttt 
aggtggacta 
tatacttaac 
atacagaaac 
ttcttcacag 
gaatataaaa 
gcaacaagat 
atgtagtaat 
gcttgggtga 
caacctactt 
acttgctgag 
aacaaggcag 
tacacaggga 
gggacatgga 
gatttgtgaa 



gcccacaeea 
tgatggaaac 
acggtttcca 
tccagctgag 
cgggcagggg 
ctacctacag 
aaactactga 
gagcagtggt 
cacaactaga 
atttgtaacc 
gtttcaggtt 
tggtatggct 
gttcctatac 
cctagaagat 
tgtcctgcag 
cggttttgtc 
atctggggtt 
ttttccatcg 
tccttactac 
catttttaaa 
aaaggtattt 
cccctagaca 
aaccagacag 
cagttctttt 
tcagtgtcta 
ggagttttac 
tgaatgcctc 
ggtggatgtc 
aagacattaa 
attttgtcat 
ttgttgtctt 
ttctcttgga 
ttgccaagct 
gattaagcaa 
gtgctctata 
ctgctcttcc 
tgtaatggga 
aataaataaa 
agaatttact 
attactcttt 
tggacagagg 
gtcttttctc 
tttaaactgc 
ctgcagccat 
gccagagttt 
gaggcagaga 
tctaggatag 
taaaattata 
ctttctgatg 
actgtgggct 
cccatggctt 
tgatttccag 
tttaaaaaat 
atattaaatg 
ttttttttta 
tgttgcatcg 
ccaaagtcaa 
atgatagctt 
atgtagacta 
acttttggaa 
tagttggtaa 
tttaaaaaaa 
cacacaagag 
acaaccaaga 
tattaaaata 
tttctccggt 
attgttttca 



gtggcgcggc 
cagccatcgc 
tatggggatt 
cgccggtcgc 
ggatctttgt 
agatttaaag 
ttctaattgt 
ggaatgccag 
atgcagtgaa 
attataagct 
cagggggagg 
gattatgatc 
tttctagaga 
gggcgggagt 
gggaattgaa 
gggaagtttt 
ttatgcagca 
aggtagatta 
agtatgaaat 
gagcccagta 
tagaacactc 
gagcattggc 
attagttaca 
ataactcctt 
tcacctctcc 
aacaatagat 
tctccttttt 
tcctccccat 
aaggtatatt 
tgtacacatc 
tcctgaccta 
ctggcttgac 
tcttcaggac 
caaatgtatt ' 
ataatactat 
agaagtcctg 
tctgatgccc 
tcctccttct 
gaagtaatga 
aagcactgga 
agccataact 
ttacactaag 
tagctttact 
taetaaacat 
agccagccag 
caggcagatc 
ccaggaatac 
attgaagcat 
agatttgggt 
ttctttaaag 
aaatggcatg 
ggtggggttg 
gggttatata 
agtccctccc 
aatgtggtat 
cttaatcaga 
attaagaatt 
ttcctgaggc 
aagttctgcc 
cttgcaggtc 
aatgtgtttc 
aaagccaggc 
tagttacttg 
aactacagtt 
ttccaaataa 
gaataggcag 
agtgatagtt 



gacttccagt 
catctgctgc 
ggtggcgacg 
taccattacc 
gaaggaacct 
ctctaaggta 
ttgtgtattt 
atccagacat 
aaaaatgctt 
gcaataaaca 
tgtgggaggt 
tgcggccaaa 
ataggaactt 
cttctgggca 
caggtgtaaa 
ttaatagggg 
aaactacagg 
aagacatgct 
tacagtgtcg 
cttcatatcc 
attttagccc 
attttccctt 
tacaccacaa 
agtacacttt 
cgtcagtggt 
gtattgagaa 
ctccatttat 
attacctgat 
tcattattga 
tgtaaaaggt 
aggtcttgtg 
tcatggcatt 
ctataatttt 
tgtgaagctt 
ecaggggctg 
agttcaattc 
tcttctggtg 
tcttcttttt 
aatactttgt 
aatgttacca 
gcagacttgt 
caataaagaa 
taacttttgt 
gaaagcaagt 
tggtggtgct 
tctgagtttg 
acacagaaac 
tccctaatga 
attatttttt 
cctccttcct 
gctttttgcc 
ggaaatcttt 
ataaacctca 
ctataaaaga 
ctttagaacc 
ttttctagtt 
tctgactttt 
agggtctcac 
tgcttttgtc 
agattctttt 
aagtgatgaa 
ctgttagagc 
gcaggctcct 
aaggttacct 
tatttcattc 
agttggaaac 
aaagcccatg 



tcaacateag 
acgcggaaga 
actcctggag 
agttggtctg 
tacttctgtg 
aatataaaat 
tagattccaa 
gataagatac 
tatttgtgaa 
agttaacaac 
tttttaaagc 
tcggccggcc 
cggaatagga 
ggcttaaagg 
attggaggga 
caaataagga 
ttattattgc 
cacccgagtt 
cgagttagac 
atttctcccg 
cattttcatt 
tcctgatctt 
atcgaggctg 
ttgttgatcc 
gttccacatt 
tccaacctaa 
aaactgagct 
gtatcttaca 
gccacatggt 
ggttcctttt 
agcttgtatt 
ctacacgtta 
gcttgacttg 
ggtttttagg- 
gagaggtggc 
ccagcaacca 
tgtctgaaga 
ttttttttta 
gtttgttcca 
aggaactaat 
gggatacaga 
ataaaaattg 
gcttcatcta 
aatgataatt 
tgcctttatg 
agcccagcct 
cctgttgggg 
gccactatgg 
ctgtctctgc 
gccatgtggt 
ttctaagggc 
caaacactaa 
taaaatagtt 
gctattaagg 
aagggtctta 
tcaaatccag 
aatgttaatt 
tatgtatctc 
tcctgaatac 
ataggggaca 
aacttgaatt 
atgcttaagg 
ggtgagagca 



ccgctacagt 
aggcacatgg 
cccgtcagta 
gtgtcaaaaa 
gtgtgacata 
ttttaagtgt 
cctatggaac 
attgatgagt 
atttgtgatg 
aacaattgca 
aagtaaaacc 
taggcgcgcc 
acttcaagct 
ctaacctggt 
caagacttcc 
aaatgggagg 
ttgtgatccg 
ttatactctc 
tatgtaagca 
ctccttctgc 
tattatactg 
agaagtctga 
tagctggggc 
tttgccttga 
tgggcctatt 



gtctttaaac 
aagttttccc 
taaacaaatg 
agatacagaa 



agcttaactt 
attaaccatt 
tattgccagg 
attgattact 
ggaatgcaaa 
ttttctattt 
ttgctggtct 
tagccaaaca 
' ttgttgtgtt 
tcggagttca 
catggtggct 
ccacaagtgt 
aagagaatac 
atatggtagc 
ttttatttga 
agaccaatgc 
aacttctagt 
tacaaagctg 
ttggatttca 
cctttaatcc 
ggtctacaca 

a <jgggggctc 

atgtggctaa 
tgttggttgg 
ctcttgtttg 
agctgctgag 
aattgtcctt 
atgaggagtg 
ctttttgtct 
gagttttagt 
agaatccaaa 
tgcttactgt 
tgcctgatct 
ta'aggttaaa 
cactaaggga 
attatcaccg 
gatccctagg 
tatttcaaaa 
catctgcata 
ccatcaaatt 
ttggttttgt 
caaagctgct 



6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 



133 



10 



15 



atttcgaggt 
gatcatgtgc 
aggtgatgtc 
tacttaatgt 
aataagtcat 
agactctctg 
tcaagcattt 
ggacatgttt 
gattagcact 
tctcaaataa 
taatttttgt 
aagccatatt 
tatagccctg 
ccgcctgcct 
gttggatatt 
cagtcagtag 
cagaggctgt 
gggtcaggga 
tctgatagaa 



ctcttggttt 
taggcctacc 
atatgatttc 
gaaagttagg 
ttttacatgt 
acctagtaac 
caccttgaag 
atccagaaga 
gttagtgagc 
tgctggcctt 
teaaagaaat 
ttttttcctt 
gctgtcctgg 
ctgcctcctg 
ttgttatata 
tcttaagtgg 
tggtactagt 
tagaaactag 
atatttcagg 



atactcagaa 
ttaggctgat 
atatatcaag 
tctttgtggg 
cttacatttg 
cctacctata 
ttgagacgtt 
tattcaggac 
attgagtggc 
ttttaaaaag 
acttgtttgg 
tttttttttt 
aactcacttt 
agtgccggga 
actataacca 
tctttattgg 
ggcacttaag 
tctagcgttt 
acat 



gcacttcttt 
tgttgttcaa 
gcaaaacatg 
tttgattttt 
gtggaattgt 
gagcactttg 
ttgttagtgt 
tatttttgac 
ctttaggctt 
cccttgttct 
atctcctttt 
tttttggttt 
gtagaccagg 
ttaaaggcgt 
atactaactc 
cccttcatta 
caacttccta 
tgtataccta 



gggtttccct 

ataaacttaa 

ttatatatgt 

aattttcaaa 

ataattgtgg 

ctgggtcaca 

atactagttt 

tgggctaagg 

gaattggagt 

ttatcaccct . 

gacaacaata 

ttcgagacag 

ctggcctcga 

gcaccaccac 

cactgggtgg 

aaatctactg 

cggatatact 

ccagctttat 



gcactatcct 
gtttcctgtc 
taaacatttg 
acctgagcta 
tttgcaggca 
agtctaggag 
atatgttgga 
aattgattct 
cacttgtata 
gttttctaca 
gcatgttttc 
ggtttctctg 
actcagaaat 
gcctggctaa 
atttttaatt 
ttcactctaa 
agcagattaa 
actaccttgt 



10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11784 



<210> 107 
<211> 1458 
<212> DNA 

<213> Bacteriophage TP901-1 



<220> 
<221> CDS 
<222> (1). 



(1455) 



15 



40 



<400> 107 

atg act aag aaa gta gca ate tat aca cga gta tec act act aac caa 4 8 
Met Thr Lys Lys Val Ala lie Tyr Thr Arg Val Ser Thr Thr Asn Gin 
1 5 10 15 

gca gag gaa ggg ttc tea att gat gag caa att gac cgt tta aca aaa 96 
Ala Glu Glu Gly Phe Ser lie Asp Glu Gin He Asp Arg Leu Thr Lys 
20 25 30 

tat get gaa gca atg ggg tgg caa gta tct gat act tat act gat get 14 4 
Tyr Ala Glu Ala Met Gly Trp Gin Val Ser Asp Thr Tyr Thr Asp Ala 

35 4 0 |>45' . 



45 



50 



ggt ttt tea ggg gec aaa ctt gaa cgc cca gca atg caa aga tta ate 192 
Gly Phe Ser Gly Ala Lys Leu Glu Arg Pro Ala Met Gin Arg Leu He 
50 55 60 



aac gat ate gag aat aaa get ttt gat aca gtt ctt gta tat aag eta 
Asn Asp He Glu Asn Lys Ala Phe Asp Thr Val Leu Val Tyr Lys Leu 
65 70 75 . 80 



240 



55 



60 



gac cgc ctt tea cgt agt gta aga gat act ctt tat ctt gtt aag gat 288 
Asp Arg Leu Ser Arg Ser Val Arg Asp Thr Leu Tyr Leu Val Lys Asp 

8 5 90 ■' 95 

gtg ttc aca aaa aat aaa ata gac ttt ate teg ctt aat gaa agt att 336 
Val Phe Thr Lys Asn Lys He Asp Phe He Ser Leu Asn Glu Ser He 
100 105 110 

gat act tct tct get atg ggt age ttg ttt etc act att ctt tct gca 384 
Asp Thr Ser Ser Ala Met Gly Ser Leu Phe Leu Thr He Leu Ser Ala 
115 120 125 



att aat gag ttt gaa aga gag aat ata aaa gaa cgc atg act atg ggt 
65. He Asn Glu Phe Glu Arg Glu Asn He Lys Glu Arg Met Thr Met Gly 
130 135 140 



432 



aaa eta ggg cga gcg aaa tct ggt aag tct atg atg tgg act aag aca 480 
Lys Leu Gly Arg Ala Lys Ser Gly Lys Ser Met Met Trp Thr Lys Thr 

145 150 155 i 6 o 

get ttt ggg tat tac cac aac aga aag aca ggt ata tta gaa att gtt 528 
Ala Phe Gly Tyr Tyr His Asn Arg Lys Thr Gly He Leu Glu He Val 
165 170 175 

cct tta caa get aca ata gtt gaa caa ata ttc act gat tat tta tea 576 
Pro Leu Gin Ala Thr He Val Glu Gin He Phe Thr Asp Tyr Leu Ser 
180 185 190 

gga ata tea ctt aca aaa tta aga gat aaa etc aat gaa tct gga cac 624 
Gly He Ser Leu Thr Lys Leu Arg Asp Lys Leu Asn Glu Ser Gly His 
195 200 205 

ate ggt aaa gat ata ccg tgg tct tat cgt acc eta aga caa aca ctt 672 
lie Gly Lys Asp He Pro Trp Ser Tyr Arg Thr Leu Arg Gin Thr Leu 
210 215 220 

gat aat cca gtt tac tgt ggt tat ate aaa ttt aag gac age eta ttt 720 
Asp Asn Pro Val Tyr Cys Gly Tyr He Lys Phe Lys Asp Ser Leu Phe 
225 230 235 " 240 

gaa -ggt atg cac aaa cca att ate cct tat gag act tat tta aaa gtt 768 
Glu Gly Met His Lys Pro He He Pro Tyr Glu Thr Tyr Leu Lys Val 
245 250 255 



caa aaa gag eta gaa gaa aga caa cag cag act tat gaa aga aat aac 
Gin Lys Glu Leu Glu Glu Arg Gin Gin Gin Thr Tyr Glu Arg Asn Asn 
260 265 270 



816 



aac cct aga cct ttc caa get aaa tat atg ctg tea ggg atg gca agg 8 64 
Asn Pro Arg Pro Phe Gin Ala Lys Tyr Met Leu Ser Gly Met Ala Arg 
275 280 285 

tgc ggt tac tgt gga gca cct tta aaa att gtt ctt ggc cac aaa aga 912 
Cys Gly Tyr Cys Gly Ala Pro Leu Lys He Val Leu Gly His Lvs Aro 
290 295 300 

aaa gat gga age cgc act atg aaa tat cac tgt gca aat aga ttt cct 960 
Lys Asp Gly Ser Arg Thr Met Lys Tyr His Cys Ala Asn Arg Phe Pro 
305 . . 310 .315 320 

cga aaa aca aaa gga att aca gta tat aat gac aat aaa aag tgt gat 1008 
Arg Lys Thr Lys Gly He Thr Val Tyr Asn Asp Asn Lys Lys Cys Asp 
325 330 335 

tea gga act tat gat tta agt aat tta gaa aat act gtt att gac aac 1056 
Ser Gly Thr Tyr Asp Leu Ser Asn Leu Glu Asn Thr Val He Asp Asn 
340 345 350 

ctg att gga ttt caa gaa aat aat gac tec tta ttg aaa att ate aat 1104 
Leu He Gly Phe Gin Glu Asn Asn Asp Ser Leu Leu Lys He He Asn 
355 360 365 

ggc aac aac caa cct att ctt gat act teg tea ttt aaa aag caa att 1152 
Gly Asn Asn Gin Pro He Leu Asp Thr Ser Ser Phe Lys Lys Gin He 
370 375 380 

tea cag ate gat aaa aaa ata caa aag aac tct gat ttg tac eta aat 1200 
Ser Gin He Asp Lys Lys lie Gin Lys Asn Ser Asp Leu Tyr Leu Asn 
385 390 395 400 

gat ttt ate act atg gat gag ttg aaa gat cgt act gat tec ctt cag 1248 
Asp Phe lie Thr Met Asp Glu Leu Lys Asp Arg Thr Asp Ser Leu Gin 
405 _ 410 415 



10 



15 



i 



40 



# 



135 



get gag aaa aag ctg ctt aaa get aag att age gaa aat aaa ttt aat 1296 

Ala Glu Lys Lys Leu Leu Lys Ala Lys He Ser Glu Asn Lys Phe Asn 
420 425 430 

gac tct act gat gtt ttt gag tta gtt aaa act cag ttg ggc tea att 1344 
Asp Ser Thr Asp Val Phe Glu Leu Val Lys Thr Gin Leu Gly Ser He 
435 440 445 

ccg att aat gaa eta tea tat gat aat aaa aag aaa ate gtc aac aac 1392 
Pro He Asn Glu Leu Ser Tyr Asp Asn Lys Lys Lys He Val Asn Asn 
450 455 460. 

ctt gta tea aag gtt gat gtt act get gat aat gta gat ate ata ttt 14 40 
Leu Val Ser Lys Val Asp Val Thr Ala Asp Asn Val Asp He He Phe 
465 470 475 480 

'! aaa ttc caa etc get taa 1458 

Lys Phe Gin Leu Ala 
! 485 

' <210> 108 
<211> 485 

5 <212> PRT 

: <213> Bacteriophage TP901-1 

' <400> 108 

. Met Thr Lys Lys Val Ala He Tyr Thr Arg Val Ser Thr Thr Asn Gin 
P 1 5 10 15 

. Ala Glu Glu Gly Phe Ser He Asp Glu Gin He Asp Arg Leu Thr Lys 
! 20 25 ' 30 

'5 Tyr Ala Glu Ala Met Gly Trp Gin Val Ser Asp Thr Tyr Thr Asp Ala 
j 35 4 0 45 



55 



60 



Gly Phe Ser Gly Ala Lys Leu Glu Arg Pro Ala Met Gin Arg Leu He 

50 55 60 

Asn Asp He Glu Asn Lys Ala Phe Asp Thr Val Leu Val Tyr Lys Leu 

65 70 75 80 



Asp Arg Leu Ser Arg Ser Val Arg Asp Thr Leu Tyr Leu Val Lys Asp 

45 85 90 95 

Val Phe Thr Lys Asn Lys He Asp Phe He Ser Leu Asn Glu Ser He 

100 105 110 

50 Asp Thr Ser Ser Ala Met Gly Ser Leu Phe Leu Thr He Leu Ser Ala 

115 120 125 



He Asn Glu Phe Glu Arg Glu Asn He Lys Glu Arg Met Thr Met Glv 
130 135 140 

Lys Leu Gly Arg Ala Lys Ser Gly Lys Ser Met Met Trp Thr Lys Thr 
145 150 155 160 

Ala Phe Gly Tyr Tyr His Asn Arg Lys Thr Gly He Leu Glu lie Val 
165 170 175 

Pro Leu Gin Ala Thr He Val Glu Gin He Phe Thr Asp Tyr Leu Ser 
180 185 190 

65. Gly He Ser Leu Thr Lys Leu Arg Asp Lys Leu Asn Glu Ser Gly His 
195 200 205 



136 



lie Gly Lys Asp lie Pro Trp Ser Tyr Arg Thr Leu Arg Gin Thr Leu 
210 215 220 

Asp Asn Pro Val Tyr Cys Gly Tyr lie Lys Phe Lys Asp Ser Leu Phe 
5 225 230 235 240 

Glu Gly Met His Lys Pro lie lie Pro Tyr Glu Thr Tyr Leu Lys Val 
245 250 255 

10 Gin Lys Glu Leu Glu Glu Arg Gin Gin Gin Thr Tyr Glu Arg Asn Asn 
260 265 270 



15 



Asn Pro Arg Pro Phe Gin Ala Lys Tyr Met Leu Ser Gly Met Ala Arg 
275 280 285 

Cys Gly Tyr Cys Gly Ala Pro Leu Lys lie Val Leu Gly His Lys Arg 
290 295 300 

Lys Asp Gly Ser Arg Thr Met Lys Tyr His Cys Ala Asn Arg Phe Pro 
305 310 315 320 

Arg Lys Thr Lys Gly lie Thr Val Tyr Asn Asp Asn Lys Lys Cys Asp 
325 330 335 

Ser Gly Thr Tyr Asp Leu Ser Asn Leu Glu Asn Thr Val lie Asp Asn 
340 345 350 

Leu lie Gly Phe Gin Glu Asn Asn Asp Ser Leu Leu Lys lie lie Asn 

355 360 365 

Gly Asn Asn Gin Pro lie Leu Asp Thr Ser Ser Phe Lys Lys Gin lie 
370 375 380 

Ser Gin lie Asp Lys Lys lie Gin Lys Asn Ser Asp Leu Tyr Leu Asn 
385 ' 390 395 400 

Asp Phe lie Thr Met Asp Glu Leu Lys Asp Arg Thr Asp Ser Leu Gin 
405 410 415 



40 Ala Glu Lys Lys Leu Leu Lys Ala Lys lie Ser Glu Asn Lys Phe Asn 
420 • ■ 425 430 



45 



Asp Ser Thr '.Asp Val Phe Glu Leu Val Lys Thr Gin Leu Gly Ser lie 

435 440 ' ' 445 

Pro lie Asn Glu Leu Ser Tyr Asp Asn Lys Lys Lys lie Val Asn Asn 
450 455 * 460 



Leu Val Ser- Lys Val Asp Val Thr Ala Asp Asn Val Asp lie lie Phe 
50 465 470 475 480 



Lys Phe Gin Leu Ala 
485 



55 



